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S/N Unknown PATENT 
TN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

Applicant: ShoheiKoide Examiner: Unknown 

Serial No.: Unknown Group Art Unit: Unknown 

Filed: Herewith Docket: 109.034US3 

Title: ARTIFICIAL ANTIBODY POLYPEPTIDES 



PRELIMINARY AME NDM E NT 

BOX PATENT APPLICATION 
Commissioner for Patents 
Washington, D.C. 20231 

Sir: 

Prior to examination, please amend the above-identified divisional application as follows. 



In the Specification 

Please insert the enclosed SEQUENCE LISTING into the specification. 

On page 1, after the title, please insert the following: —This application is a divisional of U.S. 

Serial No. 09/096,749, filed June 12, 1998, which claims priority under 35 U.S.C. § 1 19(e) to 

provisional application U.S. Serial No. 60/049,410, filed June 12, 1997. These applications are 

incorporated herein by reference- 
On page 12, line 13 after "Amino acid sequence" insert - (SEQ ID NO: 110)--. 
On page 12, line 18 before ", preceding" insert ~ (SEQ ID NO:l 14)~. 
On page 13, line 5 after "sequences" insert ~ (SEQ ID NO:l 1 1 and SEQ ID NO:l 12, 

respectively)--. 

On page 14, line 5 after "pLB25.1" insert - (containing SEQ IDNO:115)- and after 
"pLB25.4" insert -- (containing SEQ ID NO:l 16)-. 

On page 14, line 6 after "pLB24.1" insert - (containing SEQ ID NO:l 17)- and after 
"pLB24.3" insert - (containing SEQ ID NO: 11 8)-. 

On page 18, line 16 after "(RGD)" insert - (SEQ ID NO:l 13) -. 
On page 21, please make the following changes: 

at the end of line 9 insert - (SEQ ID NO:l)~; 

at the end of line 10 insert - (SEQ ID NO:2)-; 

at the end of line 12 insert - (SEQ ID NO:3)~; 

at the end of line 13 insert ~ (SEQ ID NO:4)~; 
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at the end of line 14 insert 


(SEQ ID NO:5)-s 


at the end of line 15 insert 


« (SEQ IDNO:6)-; 


at the end of line 16 insert 


« (SEQ IDNO:7)-s 


at the end of line 17 insert 


~ (SEQ IDNO:8)~; 


at the end of line 18 insert 


- (SEQ IDNO:9)~; 


at the end of line 19 insert 


« (SEQ ID NO: 10)--; 


at the end of line 20 insert 


(SEQIDNO:ll)~; 


at the end of line 21 insert 


- (SEQ ID NO: 12)--. 


On page 25 please make the following changes: 


at the end of line 7 insert - 


(SEQ ID NO: 13)-; 


at the end of line 8 insert - 


■(SEQ ID NO: 14)--; 


at the end of line 9 insert - 


(SEQ ID NO: 15)-; 


at the end of line 10 insert 


-- (SEQ ID NO: 16)--; 


at the end of line 1 1 insert 


» (SEQ ID NO: 17)-; 


at the end of line 12 insert 


- (SEQ ID NO: 18)--; 


at the end of line 13 insert 


-- (SEQ IDNO:19)~; 


at the end of line 14 insert 


-- (SEQ IDNO:20)~; 


at the end of line 15 insert 


- (SEQIDNO:21)~; 


at the end of line 16 insert 


- (SEQ ID NO:22)~; 


at the end of line 17 insert 


-- (SEQ ID NO:23)~; 


at the end of line 18 insert 


(SEQ ID NO:24)~; 


at the end of line 19 insert 


-- (SEQ ID NO:25)~; 


at the end of line 20 insert 


- (SEQ ID NO:26)«; 


at the end of line 21 insert 


- (SEQ ID NO:27)-; 


at the end of line 22 insert 


-- (SEQ ID NO:28)~; 


at the end of line 23 insert 


- (SEQ ID NO:29)-; 


at the end of line 24 insert 


- (SEQ ID NO:30)~. 


On page 43: 





line 14, after "CARRA" insert - (SEQ ID NO:31)-, and after "RWIPLAK" insert 
(SEQIDNO:32)~; 
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line 15, after "CWRRA" insert - (SEQ ID NO:33)~, and after "RWVGLAW" insert - 

- (SEQ ID NO:34)~ ; 

line 16, after "CKHRR" insert -- (SEQ ID NO:35)~, and after "FADLWWR" insert - 
(SEQ ID NO:36)-; 

line 17, after "CRRGR insert - (SEQ ID NO:37)--", and after "RGFMWLS" insert -- 
(SEQ ID NO:38)~ ; 

line 18, after "CNWRR" insert - (SEQ ID NO:39)-, and after "RAYRYRW" insert - 

- (SEQ ID NO:40)~ ; 

line 19, after "SRLRR" insert - (SEQ ID NO:41)-, and after "PPWRV" insert -- 
(SEQ ID NO: 42)--; 

line 20, after "ARWTL" insert -- (SEQ ID NO:43)~, and after "RRWWW" insert - 
(SEQ ID NO:44)-. 

line 21, after "GQRTF" insert -- (SEQ ID NO:45)--, and after "RRWWA" insert - 
(SEQ ID NO:46)~. 
On page 45: 

line 4, after "AVTVR" insert -- (SEQ ID NO:47)--, and after "RGDSPAS" insert - 
(SEQIDNO:48)-; 

line 6, after "CNWRR" insert - (SEQ ID NO:49)«, and after "RAYRYRW" insert -- 
(SEQ ID NO:50)-; 

line 7, after "CMWRA" insert -- (SEQ ID NO:51)~, and after "RWGMLRR" insert - 
(SEQIDNO:52)~; 

line 8, after "ARMRE" insert - (SEQ LD NO:53)-, and after "RWLRGRY" insert - 
(SEQ ID NO:54)«; 

line 9, after "CARRR" insert -- (SEQ ID NO:55)-, and after "RRAGWGW" insert - 
(SEQ ID NO:56)--; 

line 10, after "CNWRR" insert - (SEQ ID NO:57)-, and after "RAYRYRW" insert - 
- (SEQ ID NO:58)- ; 

line 11, after "RWRER" insert (SEQ ID NO:59)~, and after "RHPWTER" insert - 
(SEQ ID NO:60)--; 

line 12, after "CNWRR" insert - (SEQ ID NO:61)~, and after "RAYRYRW" insert - 
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- (SEQ ID NO:62)~ ; 

line 13, after "ERRVP" insert -- (SEQ ID NO:63)--, and after "RLLLWQR" insert - 
(SEQIDNO:64)-; 

line 14, after "GRGAG" insert - (SEQ ID NO:65)--, and after "FGSFERR" insert -- 
(SEQ ID NO:66)--; 

line 15, after "CRWTR" insert - (SEQ ID NO:67)-, and after "RRWFDGA" insert - 

- (SEQ ID NO:68)--; 

line 16, after "CNWRR" insert -- (SEQ ID NO:69)--, and after "RAYRYRW" insert 

- (SEQ ID NO:70)--; 

line 20, after "AVTVR" insert -- (SEQ ID NO:71)~, and after "GRGDS" insert -- 
(SEQ ID NO:72)- ; 

line 22, after "GQRTF" insert -- (SEQ ID NO:73)--, and after "RRWWA" insert -- 
(SEQ ID NO:74)-; 

line 23, after "GQRTF" insert - (SEQ ID NO:75)--, and after "RRWWA" insert -- 
(SEQIDNO:76)~; 

line 24, after "GQRTF" insert -- (SEQ ID NO:77)-, and after "RRWWA" insert -- 
(SEQ ID NO:78)~; 

line 25, after "LRYRS" insert - (SEQ ID NO:79)-, and after "GWRWR" insert - 
(SEQ ID NO:80)--; 

line 26, after "GQRTF insert - (SEQ ID NO:81)--", and after "RRWWA" insert - 
(SEQIDNO:82)--; 

line 27, after "GQRTF" insert -- (SEQ ID NO:83)-, and after "RRWWA" insert - 
(SEQ ID NO:84)~; 

line 28, after "LRYRS" insert -- (SEQ ID NO:85)--, and after "GWRWR" insert -- 
(SEQ ID NO:86)-; 

line 29, after "LRYRS" insert - (SEQ ID NO:87)~, and after "GWRWR" insert - 
(SEQ ID NO:88)-; 

line 30, after "GQRTF" insert -- (SEQ ID NO: 89)--, and after "RRWWA" insert ~ 
(SEQ ID NO:90)~; 

line 31, after "LRYRS" insert - (SEQ ID NO:91)-, and after "GWRWR" insert - 
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(SEQIDNO:92)--. 
On page 52, Table 6: 

after "GCAGTTACCGTGCGT" insert -- (SEQ ID NO:93)», and after 
"GGCCGTGGTGACAGCCCAGCGAGC" insert -- (SEQ ID NO:95)~; 
after "AlaValThrValArg" insert - (SEQ ID NO:94)-, and after 
"GlyArgGlyAspSerProAlaSer" insert -- (SEQ ID NO:96)~ ; 
after "TCGAGGTTGCGGCGG" insert - (SEQ ID NO:97)--, and after 
"CCGCCGTGGAGGGTG" insert -- (SEQ ID NO:99); 

after "SerArgLeuArgArg" insert - (SEQ ID NO:98)~, and after "ProProTrpArgVal" 
insert- (SEQ ID NO: 100)-- ; 

after "GGTCAGCGAACTTTT" insert ~ (SEQ ID NO: 101)-, and after 
"AGGCGGTGGTGGGCT" insert ~ (SEQ ID NO: 103); 
after "GlyGlnArgThrPhe" insert - (SEQ ID NO: 102)-, and after 
"ArgArgTrpTrpAla" insert - (SEQ ID NO: 104)-- ; 
after "GCGAGGTGGACGCTT" insert - (SEQ ID NO: 105)-, and after 
"AGGCGGTGGTGGTGG" insert - (SEQ ID NO: 107)-; 
after "AlaArgTrpThrLeu" insert - (SEQ ID NO: 106)-, and after 
"ArgArgTrpTrpTrp" insert - (SEQ ID NO: 108)- . 
On page 54, line 5, after "GKKGK" insert - (SEQ ID NO: 109) --. 



Tn the Claims 

Please cancel claims 1-14, 30-35, and 38-39. 



Tn the Drawing s 

Figure 2 is amended by changing the abbreviation for the amino acid at position 46 from "O" 
to "Q". Enclosed is a copy of sheet 2 of the drawings showing the amendment to Figure 2 in red ink. 
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Remarks 

Claims 1-14, 30-35, and 38-39 have been canceled. Claims 15-29, 36 and 37 remain 
pending. 

This Preliminary Amendment is submitted in part to conform the above-referenced 
application to the requirements of 37 C.F.R. §§ 1.821 through 1.825. 

Applicant's Representatives recently discovered that Figure 2 of the Formal Drawings, filed 
with the application, contains a typographical error. In the first line of the sequence, the "O" at 
position 46 should be "Q" to represent a glutamine residue. As indicated on page 1 1, line 25 through 
page 12, line 2, "O" is not one of the abbreviations set forth whereas "Q" is. Further, amino acid 
residue number 46 of Figure 5 (SEQ ID NO:l 1 1), which corresponds to amino acid residue number 
46 of Figure 2, is clearly glutamine. This error will be corrected in the Formal Drawings to be 
submitted after receipt of a Notice of Allowance. 

The Commissioner is hereby authorized to charge any additional fees to Deposit 
Account No. 19-0743, 

When this application is taken up for the first Office Action, Applicant respectfully requests 
that the Examiner consider the Preliminary Amendment and remarks presented herein. 

Respectfully submitted, 

SHOHEI KOIDE 

By his Representatives, 

SCHWEGMAN, LUNDBERG, WOESSNER & KLUTH, P.A. 
P.O. Box 2938 
Minneapolis, MN 55402 
(612) 373-6961 



Date // fM^A^f Oc&o By 




Ann S. Viksnins 
Reg. No. 37,748 



Express Mail No.: KT .7093059751 JS Mailing Date: August 1 1 , 2000 

I hereby certify that this paper or fee is being deposited with the United States Postal Service "ExweSS Mail Post Office to Addressee" service under 
37 C.F.R. 1 . 10 on the date indicated above and is addressed to Box PATENT APPLICATION, AsadsUat Commissioner for Patents, Washington, 



D.C. 20231. 
Shawn T„ Hise 



Printed Name 



Signature 
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ARTIFICIAL ANTIBODY POLYPEPTIDES 

FIELD OF THE INVENTION 

The present invention relates generally to the field of the production and 
5 selection of binding and catalytic polypeptides by the methods of molecular biology, 
using both combinatorial chemistry and recombinant DNA. The invention 
specifically relates to the generation of both nucleic acid and polypeptide libraries 
derived therefrom encoding the molecular scaffolding of Fibronectin Type III (Fn3) 
modified in one or more of its loop regions. The invention also relates to the 
10 "artificial mini-antibodies" or "monobodies," i.e., the polypeptides comprising an 
Fn3 scaffold onto which loop regions capable of binding to a variety of different 
molecular structures (such as antibody binding sites) have been grafted. 

BACKGROUND OF THE INVENTION 

15 Antihodv structure 

A standard antibody (Ab) is a tetrameric structure consisting of two identical 
immunoglobulin (Ig) heavy chains and two identical light chains. The heavy and 
light chains of an Ab consist of different domains. Each light chain has one variable 
domain (VL) and one constant domain (CL), while each heavy chain has one 

20 variable domain (VH) and three or four constant domains (CH) (Alzari et al„ 1988). 
Each domain, consisting of ~ 1 10 amino acid residues, is folded into a characteristic 
p-sandwich structure formed from two p-sheets packed against each other, the 
immunoglobulin fold. The VH and VL domains each have three complementarity 
determining regions (CDR1-3) that are loops, or turns, connecting p-strands at one 

25 end of the domains (Fig. 1 : A, C). The variable regions of both the light and heavy 
chains generally contribute to antigen specificity, although the contribution of the 
individual chains to specificity is not always equal. Antibody molecules have 
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evolved to bind to a large number of molecules by using six randomized loops 
(CDRs). However, the size of the antibodies and the complexity of six loops 
represents a major design hurdle if the end result is to be a relatively small peptide 
ligand. 

5 

Antibody substructures 

Functional substructures of Abs can be prepared by proteolysis and by 
recombinant methods. They include the Fab fragment, which comprises the VH- 
CH1 domains of the heavy chain and the VL-CL1 domains of the light chain joined 

10 by a single interchain disulfide bond, and the Fv fragment, which comprises only the 
VH and VL domains. In some cases, a single VH domain retains significant affinity 
(Ward et aL, 1989). It has also been shown that a certain monomelic k light chain 
will specifically bind to its cognate antigen. (L. Masat et ah, 1994), Separated light 
or heavy chains have sometimes been found to retain some antigen-binding activity 

15 (Ward et aL, 1989). These antibody fragments are not suitable for structural analysis 
using NMR spectroscopy due to their size, low solubility or low conformational 
stability. 

Another functional substructure is a single chain Fv (scFv), comprised of the 
variable regions of the immunoglobulin heavy and light chain, covalently connected 

20 by a peptide linker (S-z Hu et aL, 1996). These small (M r 25,000) proteins generally 
retain specificity and affinity for antigen in a single polypeptide and can provide a 
convenient building block for larger, antigen-specific molecules. Several groups 
have reported biodistribution studies in xenografted athymic mice using scFv 
reactive against a variety of tumor antigens, in which specific tumor localization has 

25 been observed. However, the short persistence of scFvs in the circulation limits the 
exposure of tumor cells to the scFvs, placing limits on the level of uptake. As a 
result, tumor uptake by scFvs in animal studies has generally been only l-5%ID/g as 
opposed to intact antibodies that can localize in tumors ad 30-40 %H)/g and have 
reached levels as high as 60-70 %E)/g. 
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A small protein scaffold called a "minibody" was designed using a part of 
the Ig VH domain as the template (Pessi et aL, 1993). Minibodies with high affinity 
(dissociation constant (K d ) ~ 10" 7 M) to interleukin-6 were identified by randomizing 
loops corresponding to CDR1 and CDR2 of VH and then selecting mutants using 
5 the phage display method (Martin et aL, 1994). These experiments demonstrated 
that the essence of the Ab function could be transferred to a smaller system* 
However, the minibody had inherited the limited solubility of the VH domain 
(Bianchi et aL, 1994). 

It has been reported that camels (Camelus dromedarius) often lack variable 

10 light chain domains when IgG-like material from their serum is analyzed, suggesting 
that sufficient antibody specificity and affinity can be derived form VH domains 
(three CDR loops) alone. Davies and Riechmann recently demonstrated that 
"cameiized" VH domains with high affinity (K d ~ 10 7 M) and high specificity can 
be generated by randomizing only the CDR3. To improve the solubility and 

15 suppress nonspecific binding, three mutations were introduced to the framework 
region (Davies & Riechmann, 1995). It has not been definitively shown, however, 
that camelization can be used, in general, to improve the solubility and stability of 
VHs. 

An alternative to the "minibody" is the "diabody." Diabodies are small 
20 bivalent and bispecific antibody fragments, i.e., they have two antigen-binding sites. 
The fragments comprise a heavy-chain variable domain (V H ) connected to a light- 
chain variable domain Qf{) on the same polypeptide chain (V^Vl). Diabodies are 
similar in size to an Fab fragment By using a linker that is too short to allow 
pairing between the two domains on the same chain, the domains are forced to pair 
25 with the complementary domains of another chain and create two antigen-binding 
sites. These dimeric antibody fragments, or "diabodies," are bivalent and bispecific. 
P. Holliger et aL, PNAS 90:6444-6448 (1993). 

Since the development of the monoclonal antibody technology, a large 
number of 3D structures of Ab fragments in the complexed and/or free states have 
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been solved by X-ray crystallography (Webster et al., 1994; Wilson & Stanfield, 
1994). Analysis of Ab structures has revealed that five out of the six CDRs have 
limited numbers of peptide backbone conformations, thereby permitting one to 
predict the backbone conformation of CDRs using the so-called canonical structures 
5 (Lesk & Tramontane 1992; Rees et al., 1994). The analysis also has revealed that 
the CDR3 of the VH domain (VH-CDR3) usually has the largest contact surface and 
that its conformation is too diverse for canonical structures to be defined; VH-CDR3 
is also known to have a large variation in length (Wu et al., 1993). Therefore, the 
structures of crucial regions of the Ab-antigen interface still need to be 

1 0 experimentally determined. 

Comparison of crystal structures between the free and complexed states has 
revealed several types of conformational rearrangements. They include side-chain 
rearrangements, segmental movements, large rearrangements of VH-CDR3 and 
changes in the relative position of the VH and VL domains (Wilson & Stanfield, 

15 1993). In the free state, CDRs, in particular those which undergo large 

conformational changes upon binding, are expected to be flexible. Since X-ray 
crystallography is not suited for characterizing flexible parts of molecules, structural 
studies in the solution state have not been possible to provide dynamic pictures of 
the conformation of antigen-binding sites. 

20 

Mimicking the antibody-binding site 

CDR peptides and organic CDR mimetics have been made (Dougall et aL, 
1994). CDR peptides are short, typically cyclic, peptides which correspond to the 
amino acid sequences of CDR loops of antibodies. CDR loops are responsible for 
25 antibody-antigen interactions. Organic CDR mimetics are peptides corresponding to 
CDR loops which are attached to a scaffold, e.g., a small organic compound. 

CDR peptides and organic CDR mimetics have been shown to retain some 
binding affinity (Smyth & von Itzstein, 1994). However, as expected, they are too 
small and too flexible to maintain full affinity and specificity. Mouse CDRs have 
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been grafted onto the human Ig framework without the loss of affinity (Jones et aL, 
1986; Riechmann et aL, 1988), though this "humanization" does not solve the 
above-mentioned problems specific to solution studies. 

5 Mimicking natural selection processes of Abs 

In the immune system, specific Abs are selected and amplified from a large 
library (affinity maturation). The processes can be reproduced in vitro using 
combinatorial library technologies. The successful display of Ab fragments on the 
surface of bacteriophage has made it possible to generate and screen a vast number 

10 of CDR mutations (McCafferty et aL, 1990; Barbas et aL, 1991; Winter et aL, 1994). 
An increasing number of Fabs and Fvs (and their derivatives) is produced by this 
technique, providing a rich source for structural studies. The combinatorial 
technique can be combined with Ab mimics. 

A number of protein domains that could potentially serve as protein scaffolds 

15 have been expressed as fusions with phage capsid proteins. Review in Clackson & 
Wells, Trends Biotechnol. 12:173-184 (1994). Indeed, several of these protein 
domains have already been used as scaffolds for displaying random peptide 
sequences, including bovine pancreatic trypsin inhibitor (Roberts et aL, PNAS 
89:2429-2433 (1992)), human growth hormone (Lowman et aL, Biochemistry 

20 30: 1083240838 (1991)), Venturini et aL, Protein Peptide Letters 1 :70-75 (1994)), 
and the IgG binding domain of Streptococcus (O'Neil et aL, Techniques in Protein 
Chemistry V (Crabb, L,. ed.) pp. 517-524, Academic Press, San Diego (1994)). 
These scaffolds have displayed a single randomized loop or region. 

Researchers have used the small 74 amino acid cc-amylase inhibitor 

25 Tendamistat as a presentation scaffold on the filamentous phage M13 (McConnell 
and Hoess, 1995). Tendamistat is a p-sheet protein from Streptomyces tendae. It 
has a number of features that make it an attractive scaffold for peptides, including its 
small size, stability, and the availability of high resolution NMR and X-ray 
structural data. Tendamistat' s overall topology is similar to that of an 
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immunoglobulin domain, with two p-sheets connected by a series of loops. In 
contrast to immunoglobulin domains, the P-sheets of Tendamistat are held together 
with two rather than one disulfide bond, accounting for the considerable stability of 
the protein. By analogy with the CDR loops found in immunoglobulins, the loops 
5 the Tendamistat may serve a similar function and can be easily randomized by in 
vitro mutagenesis. 

Tendamistat, however, is derived from Streptomyces tendae. Thus, while 
Tendamistat may be antigenic in humans, its small size may reduce or inhibit its 
antigenicity. Also, Tendamistat' s stability is uncertain. Further, the stability that is 

10 reported for Tendamistat is attributed to the presence of two disulfide bonds. 

Disulfide bonds, however, are a significant disadvantage to such molecules in that 
they can be broken under reducing conditions and must be properly formed in order 
to have a useful protein structure. Further, the size of the loops in Tendamistat are 
relatively small, thus limiting the size of the inserts that can be accommodated in the 

15 scaffold. Moreover, it is well known that forming correct disulfide bonds in newly 
synthesized peptides is not straightforward. When a protein is expressed in the 
cytoplasmic space of E. coli, the most common host bacterium for protein 
overexpression, disulfide bonds are usually not formed, potentially making it 
difficult to prepare large quantities of engineered molecules. 

20 Thus, there is an on-going need for small, single-chain artificial antibodies 

for a variety of therapeutic, diagnostic and catalytic applications. 

SUMMARY OF THE INVENTION 

The invention provides a fibronectin type IE (Fn3) polypeptide monobody 
25 comprising a plurality of Fn3 P-strand domain sequences that are linked to a 
plurality of loop region sequences. One or more of the monobody loop region 
sequences of the Fn3 polypeptide vary by deletion, insertion or replacement of at 
least two amino acids from the corresponding loop region sequences in wild-type 
Fn3. The p-strand domains of the monobody have at least about 50% total amino 
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acid sequence homology to the corresponding amino acid sequence of wild-type 
Fn3*s p-strand domain sequences. Preferably, one or more of the loop regions of the 
monobody comprise amino acid residues: 

i) from 15 to 16 inclusive in an AB loop; 
5 ii) from 22 to 30 inclusive in a BC loop; 

iii) from 39 to 45 inclusive in a CD loop; 

iv) from 5 1 to 55 inclusive in a DE loop; 

v) from 60 to 66 inclusive in an EF loop; and 

vi) from 76 to 87 inclusive in an FG loop. 

10 The invention also provides a nucleic acid molecule encoding a Fn3 

polypeptide monobody of the invention, as well as an expression vector comprising 
said nucleic acid molecule and a host cell comprising said vector. 

The invention further provides a method of preparing a Fn3 polypeptide 
monobody. The method comprises providing a DNA sequence encoding a plurality 

15 of Fn3 P-strand domain sequences that are linked to a plurality of loop region 
sequences, wherein at least one loop region of said sequence contains a unique 
restriction enzyme site. The DNA sequence is cleaved at the unique restriction site. 
Then a preselected DNA segment is inserted into the restriction site. The 
preselected DNA segment encodes a peptide capable of binding to a specific binding 

20 partner (SBP) or a transition state analog compound (TSAC). The insertion of the 
preselected DNA segment into the DNA sequence yields a DNA molecule which 
encodes a polypeptide monobody having an insertion. The DNA molecule is then 
expressed so as to yield the polypeptide monobody. 

Also provided is a method of preparing a Fn3 polypeptide monobody, which 

25 method comprises providing a replicatable DNA sequence encoding a plurality of 
Fn3 P-strand domain sequences that are linked to a plurality of loop region 
sequences, wherein the nucleotide sequence of at least one loop region is known. 
Polymerase chain reaction (PCR) primers are provided or prepared which are 
sufficiently complementary to the known loop sequence so as to be hybridizable 
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under PCR conditions, wherein at least one of the primers contains a modified 
nucleic acid sequence to be inserted into the DNA sequence. PCR is performed 
using the replicatable DNA sequence and the primers. The reaction product of the 
PCR is then expressed so as to yield a polypeptide monobody. 
5 The invention further provides a method of preparing a Fn3 polypeptide 

monobody. The method comprises providing a replicatable DNA sequence 
encoding a plurality of Fn3 P-strand domain sequences that are linked to a plurality 
of loop region sequences, wherein the nucleotide sequence of at least one loop 
region is known. Site-directed mutagenesis of at least one loop region is performed 

10 so as to create an insertion mutation. The resultant DNA comprising the insertion 
mutation is then expressed. 

Further provided is a variegated nucleic acid library encoding Fn3 
polypeptide monobodies comprising a plurality of nucleic acid species encoding a 
plurality of Fn3 P-strand domain sequences that are linked to a plurality of loop 

15 region sequences, wherein one or more of the monobody loop region sequences vary 
by deletion, insertion or replacement of at least two amino acids from corresponding 
loop region sequences in wild-type Fn3, and wherein the p-strand domains of the 
monobody have at least a 50% total amino acid sequence homology to the 
corresponding amino acid sequence of p-strand domain sequences of the wild-type 

20 Fn3. The invention also provides a peptide display library derived from the 
variegated nucleic acid library of the invention. Preferably, the peptide of the 
peptide display library is displayed on the surface of a bacteriophage, e.g., a M13 
bacteriophage or a fd bacteriophage, or virus. 

The invention also provides a method of identifying the amino acid sequence 

25 of a polypeptide molecule capable of binding to a specific binding partner (SBP) so 
as to form a polypeptide:SSP complex, wherein the dissociation constant of the said 
polypeptide:SBP complex is less than moles/liter. The method comprises the 
steps of: 

a) providing a peptide display library of the invention; 
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b) 



c) 



d) 



e) 



contacting the peptide display library of (a) with an immobilized or 
separable SBP; 

separating the peptidezSBP complexes from the free peptides; 
causing the replication of the separated peptides of (c) so as to result 
in a new peptide display library distinguished from that in (a) by 
having a lowered diversity and by being enriched in displayed 
peptides capable of binding the SBP; 

optionally repeating steps (b), (c), and (d) with the new library of (d); 



and 
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determining the nucleic acid sequence of the region encoding the 



displayed peptide of a species from (d) and hence deducing the 
peptide sequence capable of binding to the SBP. 
The present invention also provides a method of preparing a variegated 
nucleic acid library encoding Fn3 polypeptide monobodies having a plurality of 
15 nucleic acid species each comprising a plurality of loop regions, wherein the species 
encode a plurality of Fn3 p-strand domain sequences that are linked to a plurality of 
loop region sequences, wherein one or more of the loop region sequences vary by 
deletion, insertion or replacement of at least two amino acids from corresponding 
loop region sequences in wild-type Fn3, and wherein the P-strand domain sequences 
20 of the monobody have at least a 50% total amino acid sequence homology to the 
corresponding amino acid sequences of p-strand domain sequences of the wild-type 
Fn3, comprising the steps of 



a) 



preparing an Fn3 polypeptide monobody having a predetermined 



sequence; 



25 



b) 



contacting the polypeptide with a specific binding partner (SBP) so 
as to form a polypeptide:SSP complex wherein the dissociation 
constant of the said polypeptide:SBP complex is less than 10" 6 
moIesAiter; 
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c) determining the binding structure of the po!ypeptide:SBP complex by 
nuclear magnetic resonance spectroscopy or X-ray crystallography; 
and 

d) preparing the variegated nucleic acid library, wherein the variegation 
5 is performed at positions in the nucleic acid sequence which, from 

the information provided in (c), result in one or more polypeptides 
with improved binding to the SBP. 
Also provided is a method of identifying the amino acid sequence of a 
polypeptide molecule capable of catalyzing a chemical reaction with a catalyzed rate 
10 constant, k^, and an uncatalyzed rate constant, k^^, such that the ratio of k ca /k iuical 
is greater than 10. The method comprises the steps of: 

a) providing a peptide display library of the invention; 

b) contacting the peptide display library of (a) with an immobilized or 
separable transition state analog compound (TS AC) representing the 

15 approximate molecular transition state of the chemical reaction; 

c) separating the peptide:TSAC complexes from the free peptides; 

d) causing the replication of the separated peptides of (c) so as to result 
in a new peptide display library distinguished from that in (a) by 
having a lowered diversity and by being enriched in displayed 

20 peptides capable of binding the TS AC; 

e) optionally repeating steps (b), (c), and (d) with the new library of (d); 
and 

f) determining the nucleic acid sequence of the region encoding the 
displayed peptide of a species from (d) and hence deducing the 

25 peptide sequence. 

The invention also provides a method of preparing a variegated nucleic acid 
library encoding Fn3 polypeptide monobodies having a plurality of nucleic acid 
species each comprising a plurality of loop regions, wherein the species encode a 
plurality of Fn3 P-strand domain sequences that are linked to a plurality of loop 
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region sequences, wherein one or more of the loop region sequences vary by 
deletion, insertion or replacement of at least two amino acids from corresponding 
loop region sequences in wild-type Fn3, and wherein the P-strand domain sequences 
of the monobody have at least a 50% total amino acid sequence homology to the 
corresponding amino acid sequences of P-strand domain sequences of the wild-type 
Fn3, comprising the steps of 

a) preparing an Fn3 polypeptide monobody having a predetermined 
sequence, wherein the polypeptide is capable of catalyzing a 
chemical reaction with a catalyzed rate constant, k^, and an 
uncatalyzed rate constant, k^, such that the ratio of k^/k^ is 
greater than 10; 

b) contacting the polypeptide with an immobilized or separable 
transition state analog compound (TS AC) representing the 
approximate molecular transition state of the chemical reaction; 

c) determining the binding structure of the polypeptide:TS AC complex 
by nuclear magnetic resonance spectroscopy or X-ray 
crystallography; and 

d) preparing the variegated nucleic acid library, wherein the variegation 
is performed at positions in the nucleic acid sequence which, from 
the information provided in (c), result in one or more polypeptides 
with improved binding to or stabilization of the TS AC. 

The invention also provides a kit for the performance of any of the methods 
of the invention. The invention further provides a composition, e.g., a polypeptide, 
prepared by the use of the kit, or identified by any of the methods of the invention. 

The following abbreviations have been used in describing amino acids, 
peptides, or proteins: Ala, or A, Alanine; Arg, or R, Arginine; Asn or N, asparagine; 
Asp, or D, aspartic acid; Cysor C, cystein; Gin, or Q, glutamine; Glu, or E, glutamic 
acid; Gly, or G, glycine; His, or H, histidine; Be, or I, isoleucine; Leu, or L, leucine; 
Lys, or K, lysine; Met, or M, methionine; Phe, or F, phenylalanine; Pro, or P, 
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proline; Ser, or S, serine; Thr, or T, threonine; Tip, or W, tryptophan; Tyr, or Y, 
tyrosine; Val, or V, valine. 

The following abbreviations have been used in describing nucleic acids, 
DNA, or RNA: A, adenosine; T, thymidine; G, guanosine; C, cytosine. 

5 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1. p-Strand and loop topology (A, B) and MOLSCREPT 
representation (C, D; Kraulis, 1991) of the VH domain of anti4ysozyme 
immunoglobulin D1.3 (A, C; Bhat et ah, 1994) and 10th type IH domain of human 

10 fibronectin (B, D; Main et al., 1992). The locations of complementarity determining 
regions (CDRs, hypervariable regions) and the integrin-binding Arg-Gly-Asp (RGD) 
sequence are indicated. 

Figure 2. Amino acid sequence and restriction sites of the synthetic Fn3 
gene. The residue numbering is according to Main et al. (1992). Restriction 

15 enzyme sites designed are shown above the amino acid sequence. P-Strands are 
denoted by underlines. The N-terminal "mq" sequence has been added for a 
subsequent cloning into an expression vector. The His*tag (Novagen) fusion protein 
has an additional sequence, MGSSHHHHHHSSGLVPRGSH, preceding the Fn3 
sequence shown above. 

20 Figure 3. A, Far UV CD spectra of wild-type Fn3 at 25°C and 90°C. Fn3 

(50 |iM) was dissolved in sodium acetate (50 mM ? pH 4.6). B, thermal denaturation 
of Fn3 monitored at 215 nm. Temperature was increased at a rate of 1 °C/min. 

Figure 4. A, Ca trace of the crystal structure of the complex of lysozyme 
(HEL) and the Fv fragment of the anti-hen egg-white lysozyme (anti-HEL) antibody 

25 DL3 (Bhat et aL, 1994). Side chains of the residues 99-102 of VH CDR3, which 
make contact with HEL, are also shown, B, Contact surface area for each residue of 
the D1.3 VH-HEL and VH-VL interactions plotted vs. residue number of D 1.3 VH. 
Surface area and secondary structure were determined using the program DSSP 
(Kabsh and Sander, 1983). C and D, schematic drawings of the p-sheet structure of 
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the F strand-loop-G strand moieties of D1.3 VH (C) and Fn3 (D). The boxes denote 
residues in P-strands and ovals those not in strands. The shaded boxes indicate 
residues of which side chains are significantly buried. The broken lines indicate 
hydrogen bonds. 

5 Figure 5. Designed Fn3 gene showing DNA and amino acid sequences. The 

amino acid numbering is according to Main et al. (1992). The two loops that were 
randomized in combinatorial libraries are enclosed in boxes. 

Figure 6. Map of plasmid pAS45. Plasmid pAS45 is the expression vector 
of His # tag-Fn3. 

10 Figure 7. Map of plasmid pAS25. Plasmid pAS25 is the expression vector 

ofFn3. 

Figure 8. Map of plasmid pAS38. pAS38 is a phagmid vector for the 
surface display of Fn3. 

Figure 9. (Ubiquitin-1) Characterization of ligand-specific binding of 

15 enriched clones using phage enzyme-linked immunosolvent assay (ELISA). 

Microtiter plate wells were coated with ubiquitin (1 fig/well; "Ligand (+)) and then 
blocked with BS A. Phage solution in TBS containing approximately 10 10 colony 
forming units (cfu) was added to a well and washed with TBS. Bound phages were 
detected with anti-phage antibody-POD conjugate (Pharmacia) with Turbo-TMB 

20 (Pierce) as a substrate. Absorbance was measured using a Molecular Devices 

SPECTRAmax 250 microplate spectrophotometer. For a control, wells without the 
immobilized ligand were used. 2-1 and 2-2 denote enriched clones from Library 2 
eluted with free ligand and acid, respectively. 4-1 and 4-2 denote enriched clones 
from Library 4 eluted with free ligand and acid, respectively. 

25 Figure 10. (Ubiquitin-2) Competition phage ELISA of enriched clones. 

Phage solutions containing approximately 10 10 cfu were first incubated with free 
ubiquitin at 4°C for 1 hour prior to the binding to a ligand-coated well. The wells 
were washed and phages detected as described above. 
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Figure 1 1 . Competition phage ELIS A of ubiquitin-binding monobody 411. 
Experimental conditions are the same as described above for ubiquitin. The ELIS A 
was performed in the presence of free ubiquitin in the binding solution. The 
experiments were performed with four different preparations of the same clone. 
5 Figure 12. (Fluorescein- 1) Phage ELISA of four clones, pLB25.1, pLB25.4, 

pLB24.1 and pLB24.3. Experimental conditions are the same as ubiquitin- 1 above. 

Figure 13. (FIuorescein-2) Competition ELISA of the four clones. 
Experimental conditions are the same as ubiquitin-2 above. 

Figure 14. i H, l5 N-HSQC spectrum of a fluorescence-binding monobody 
10 LB25.5. Approximately 20 jiM protein was dissolved in 10 mm sodium acetate 
buffer (pH 5.0) containing 100 mM sodium chloride. The spectrum was collected at 
30°C on a Varian Unity INOVA 600 NMR spectrometer. 

Figure 15. Characterization of the binding reaction of Ubi4-Fn3 to the 
target, ubiquitin. (a) Phage ELISA analysis of binding of Ubi4-Fn3 to ubiquitin. 
15 The binding of Ubi4-phages to ubiquitin-coated wells was measured. The control 
experiment was performed with wells containing no ubiquitin. 

(b) Competition phage ELISA of Ubi4-Fn3. Ubi4-Fn3-phages were 
preincubated with soluble ubiquitin at an indicated concentration, followed by the 
phage ELISA detection in ubiquitin-coated wells. 
20 (c) Competition phage ELISA testing the specificity of the Ubi4 clone. The 

Ubi4 phages were preincubated with 250 |ig/ml of soluble proteins, followed by 
phage ELISA as in (b). 

(d) ELISA using free proteins. 

Figure 16. Equilibrium unfolding curves for Ubi4-Fn3 (closed symbols) and 
25 wild-type Fn3 (open symbols). Squares indicate data measured in TBS (Tris HC1 
buffer (50 mM, pH 7.5) containing NaCl (150 mM)). Circles indicate data 
measured in Gly HC1 buffer (20 mM, pH 3.3) containing NaCl (300 mM). The 
curves show the best fit of the transition curve based on the two-state model. 
Parameters characterizing the transitions are listed in Table 7. 
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Figure 17. (a) ! H, l5 N-HSQC spectrum of [ l5 N]-Ubi4-KFn3. (b). Difference 
(^wiid-type " ^ubi4) °f (b) and l5 N (c) chemical shifts plotted versus residue number. 
Values for residues 82-84 (shown as filled circles) where Ubi4-K deletions are set to 
zero. Open circles indicate residues that are mutated in the Ubi4-K protein. The 
5 locations of p-strands are indicated with arrows. 

DETAILED DESCRIPTION OF THE INVENTION 

For the past decade the immune system has been exploited as a rich source 
of de novo catalysts. Catalytic antibodies have been shown to have 

10 chemoselectivity, enantioselectivity, large rate accelerations, and even an ability to 
reroute chemical reactions. In most cases the antibodies have been elicited to 
transition state analog (TS A) haptens. These TS A haptens are stable, low-molecular 
weight compounds designed to mimic the structures of the energetically unstable 
transition state species that briefly (approximate half-life 10" 13 s) appear along 

15 reaction pathways between reactants and products. Anti-TSA antibodies, like 

natural enzymes, are thought to selectively bind and stabilize transition state, thereby 
easing the passage of reactants to products. Thus, upon binding, the antibody lowers 
the energy of the actual transition state and increases the rate of the reaction. These 
catalysts can be programmed to bind to geometrical and electrostatic features of the 

20 transition state so that the reaction route can be controlled by neutralizing 

unfavorable charges, overcoming entropic barriers, and dictating stereoelectronic 
features of the reaction. By this means even reactions that are otherwise highly 
disfavored have been catalyzed (Janda et aL 1997)* Further, in many instances 
catalysts have been made for reactions for which there are no known natural or man- 

25 made enzymes. 

The success of any combinatorial chemical system in obtaining a particular 
function depends on the size of the library and the ability to access its members. 
Most often the antibodies that are made in an animal against a hapten that mimics 
the transition state of a reaction are first screened for binding to the hapten and then 
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screened again for catalytic activity. An improved method allows for the direct 
selection for catalysis from antibody libraries in phage, thereby linking chemistry 
and replication. 

A library of antibody fragments can be created on the surface of filamentous 
5 phage viruses by adding randomized antibody genes to the gene that encodes the 
phage's coat protein. Each phage then expresses and displays multiple copies of a 
single antibody fragment on its surface. Because each phage possesses both the 
surface-displayed antibody fragment and the DNA that encodes that fragment, and 
antibody fragment that binds to a target can be identified by amplifying the 

10 associated DNA. 

Immunochemists use as antigens materials that have as little chemical 
reactivity as possible. It is almost always the case that one wishes the ultimate 
antibody to interact with native structures. In reactive immunization the concept is 
just the opposite. One immunizes with compounds that are highly reactive so that 

15 upon binding to the antibody molecule during the induction process, a chemical 
reaction ensues. Later this same chemical reaction becomes part of the mechanism 
of the catalytic event. In a certain sense one is immunizing with a chemical reaction 
rather than a substance per se. Reactive immunogens can be considered as 
analogous to the mechanism-based inhibitors that enzymologists use except that 

20 they are used in the inverse way in that, instead of inhibiting a mechanism, they 
induce a mechanism. 

Man-made catalytic antibodies have considerable commercial potential in 
many different applications. Catalytic antibody-based products have been used 
successfully in prototype experiments in therapeutic applications, such as prodrug 

25 activation and cocaine inactivation, and in nontherapeutic applications, such as 
biosensors and organic synthesis. 

Catalytic antibodies are theoretically more attractive than noncatalytic 
antibodies as therapeutic agents because, being catalytic, they may be used in lower 
doses, and also because their effects are unusually irreversible (for example, peptide 
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bond cleavage rather than binding). In therapy, purified catalytic antibodies could 
be directly administered to a patient, or alternatively the patient's own catalytic 
antibody response could be elicited by immunization with an appropriate hapten. 
Catalytic antibodies also could be used as clinical diagnostic tools or as 
5 regioselective or stereoselective catalysts in the synthesis of fine chemicals. 

L Mutation of Fn3 loops and grafting of Ab loops onto Fn3 

An ideal scaffold for CDR grafting is highly soluble and stable. It is small 
enough for structural analysis, yet large enough to accommodate multiple CDRs so 

10 as to achieve tight binding and/or high specificity. 

A novel strategy to generate an artificial Ab system on the framework of an 
existing non-Ab protein was developed. An advantage of this approach over the 
minimization of an Ab scaffold is that one can avoid inheriting the undesired 
properties of Abs, Fibronectin type HI domain (Fn3) was used as the scaffold. 

15 Fibronectin is a large protein which plays essential roles in the formation of 

extracellular matrix and cell-cell interactions; it consists of many repeats of three 
types (I, II and EH) of small domains (Baron et aL, 1991). Fn3 itself is the paradigm 
of a large subfamily (Fn3 family or s-type Ig family) of the immunoglobulin 
superfamily (IgSF). The Fn3 family includes cell adhesion molecules, cell surface 

20 hormone and cytokine receptors, chaperonins, and carbohydrate-binding domains 
(for reviews, see Bork & Doolittle, 1992; Jones, 1993; Bork et aL, 1994; Campbell 
& Spitzfaden, 1994; Harpez & Chothia, 1994). 

Recently, crystallographic studies revealed that the structure of the DNA 
binding domains of the transcription factor NF-kB is also closely related to the Fn3 

25 fold (Ghosh et aL, 1995; Miiller et aL, 1995). These proteins are all involved in 
specific molecular recognition, and in most cases ligand-binding sites are formed by 
surface loops, suggesting that the Fn3 scaffold is an excellent framework for 
building specific binding proteins. The 3D structure of Fn3 has been determined by 
NMR (Main et aL, 1992) and by X-ray crystallography (Leahy et aL, 1992; 
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Dickinson et aL, 1994). The structure is best described as a p-sandwich similar to 
that of Ab VH domain except that Fn3 has seven P-strands instead of nine (Fig. 1). 
There are three loops on each end of Fn3; the positions of the BC, DE and FG loops 
approximately correspond to those of CDR1, 2 and 3 of the VH domain, 
5 respectively (Fig. 1 C, D). 

Fn3 is small (~ 95 residues), monomeric, soluble and stable. It is one of few 
members of IgSF that do not have disulfide bonds; VH has an interstrand disulfide 
bond (Fig. 1 A) and has marginal stability under reducing conditions. Fn3 has been 
expressed in E. coli (Aukhil et al., 1993). In addition, 17 Fn3 domains are present 

10 just in human fibronectin, providing important information on conserved residues 
which are often important for the stability and folding (for sequence alignment, see 
Main et al., 1992 and Dickinson et aL, 1994). From sequence analysis, large 
variations are seen in the BC and FG loops, suggesting that the loops are not crucial 
to stability. NMR studies have revealed that the FG loop is highly flexible; the 

15 flexibility has been implicated for the specific binding of the 10th Fn3 to a 5 Pj 
integrin through the Arg-Gly-Asp (RGD) motif. In the crystal structure of human 
growth hormone-receptor complex (de Vos et al., 1992), the second Fn3 domain of 
the receptor interacts with hormone via the FG and BC loops, suggesting it is 
feasible to build a binding site using the two loops. 

20 The tenth type III module of fibronectin has a fold similar to that of 

immunoglobulin domains, with seven p strands forming two antiparallel p sheets, 
which pack against each other (Main et aL, 1992), The structure of the type II 
module consists of seven p strands, which form a sandwich of two antiparallel p 
sheets, one containing three strands (ABE) and the other four strands (C'CFG) 

25 (Williams et aL, 1988). The triple-stranded P sheet consists of residues GIu-9-Thr- 
14 (A), Ser-17-Asp-23 (B), and Thr-56-Ser-60 (E). The majority of the conserved 
residues contribute to the hydrophobic core, with the invariant hydrophobic residues 
Trp-22 and Try-68 lying toward the N-terminal and C-terminal ends of the core, 
respectively. The P strands are much less flexible and appear to provide a rigid 
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framework upon which functional, flexible loops are built. The topology is similar 
to that of immunoglobulin C domains. 

Gene construction and mutagenesis 
5 A synthetic gene for tenth Fn3 of human fibronectin (Fig, 2) was designed 

which includes convenient restriction sites for ease of mutagenesis and uses specific 
codons for high-level protein expression (Gribskov et al., 1984). 

The gene was assembled as follows: (1) the gene sequence was divided into 
five parts with boundaries at designed restriction sites (Fig.2); (2) for each part, a 

10 pair of oligonucleotides that code opposite strands and have complementary 

overlaps of - 15 bases was synthesized; (3) the two oligonucleotides were annealed 
and single strand regions were filled in using the Klenow fragment of DNA 
polymerase; (4) the double-stranded oligonucleotide was cloned into the pET3a 
vector (Novagen) using restriction enzyme sites at the termini of the fragment and 

15 its sequence was confirmed by an Applied Biosystems DNA sequencer using the 
dideoxy termination protocol provided by the manufacturer; (5) steps 2-4 were 
repeated to obtain the whole gene (plasmid pAS25) (Fig. 7). 

Although the present method takes more time to assemble a gene than the 
one-step polymerase chain reaction (PCR) method (Sandhu et al., 1992), no 

20 mutations occurred in the gene. Mutations would likely have been introduced by the 
low fidelity replication by Taq polymerase and would have required time-consuming 
gene editing. The gene was also cloned into the pET15b (Novagen) vector (pEWl). 
Both vectors expressed the Fn3 gene under the control of bacteriophage 17 promoter 
(Studler et aL 1990); pAS25 expressed the 96-residue Fn3 protein only, while pEWl 

25 expressed Fn3 as a fusion protein with poly-histidine peptide (His # tag). 

Recombinant DNA manipulations were performed according to Molecular Cloning 
(Sambrook et al., 1989), unless otherwise stated. 

Mutations were introduced to the Fn3 gene using either cassette mutagenesis 
or oligonucleotide site-directed mutagenesis techniques (Deng & Nickoloff, 1992), 
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Cassette mutagenesis was performed using the same protocol for gene construction 
described above; double-stranded DNA fragment coding a new sequence was cloned 
into an expression vector (pAS25 and/or pEWl). Many mutations can be made by 
combining a newly synthesized strand (coding mutations) and an oligonucleotide 
5 used for the gene synthesis. The resulting genes were sequenced to confirm that the 
designed mutations and no other mutations were introduced by mutagenesis 
reactions. 

Design and synthesis of Fn3 mutants with antibody CDRs 

10 Two candidate loops (FG and BC) were identified for grafting. Antibodies 

with known crystal structures were examined in order to identify candidates for the 
sources of loops to be grafted onto Fn3. Anti-hen egg lysozyme (HEL) antibody 
DL3 (Bhat et al., 1994) was chosen as the source of a CDR loop. The reasons for 
this choice were: (1) high resolution crystal structures of the free and complexed 

15 states are available (Fig. 4 A; Bhat et al., 1994), (2) thermodynamics data for the 
binding reaction are available (Tello et al., 1993), (3) D1.3 has been used as a 
paradigm for Ab structural analysis and Ab engineering (Verhoeyen et al., 1988; 
McCafferty et al., 1990) (4) site-directed mutagenesis experiments have shown that 
CDR3 of the heavy chain (VH-CDR3) makes a larger contribution to the affinity 

20 than the other CDRs (Hawkins et al., 1993), and (5) a binding assay can be easily 
performed. The objective for this trial was to graft VH-CDR3 of D 1.3 onto the Fn3 
scaffold without significant loss of stability. 

An analysis of the D 1.3 structure (Fig. 4) revealed that only residues 99-102 
("RDYR") make direct contact with hen egg-white lysozyme (HEL) (Fig. 4 B), 

25 although VH-CDR3 is defined as longer (Bhat et al., 1994). It should be noted that 
the C-terminal half of VH-CDR3 (residues 101-104) made significant contact with 
the VL domain (Fig. 4 B). It has also become clear that Dl .3 VH-CDR3 (Fig. 4 C) 
has a shorter turn between the strands F and G than the FG loop of Fn3 (Fig, 4 D). 
Therefore, mutant sequences were designed by using the RDYR (99-102) of D 1.3 as 
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the core and made different boundaries and loop lengths (Table 1). Shorter loops 
may mimic the D1.3 CDR3 conformation better, thereby yielding higher affinity, but 
they may also significantly reduce stability by removing wild-type interactions of 
Fn3. 

5 

Table 1. Amino acid sequences of D 1.3 VH CDR3, VH8 CDR3 and Fn3 FG loop 
and list of planned mutants. 

96 100 105 









10 


VH8 


ARGAV Vi Y YAMDY WOOu 






75 80 85 

• • • 




Fn3 


YAVTGRGDSPASSKPI 




Mutant 


Sequence 




Dl.3-1 


YAERDYRLDY PI 




Dl.3-2 


YAVRD YRLDY PI 


15 


Dl.3-3 


YAVRDYRLDYASSKPI 




Dl.3-4 


YAVRDYRLDY KPI 




Dl.3-5 


YAVRDYR SKPI 




D 1.3-6 


YAVTRDYRL — SSKPI 




Dl.3-7 


YAVTERDYRL-SSKPI 


20 


VH8-1 


YAVAVVSYYAMDY-PI 




VH8-2 


YAVTAVVSYYASSKPI 



Underlines indicate residues in P-strands. Bold 
characters indicate replaced residues. 
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In addition, an anti-HEL single VH domain termed VH8 (Ward et al., 1989) 
was chosen as a template. VH8 was selected by library screening and, in spite of the 
lack of the VL domain, VH8 has an affinity for HEL of 27 nM, probably due to its 
longer VH-CDR3 (Table 1). Therefore, its VH-CDR3 was grafted onto Fn3. 
5 Longer loops may be advantageous on the Fn3 framework because they may provide 
higher affinity and also are close to the loop length of wild-type Fn3. The 3D 
structure of VH8 was not known and thus the VH8 CDR3 sequence was aligned 
with that of D 1.3 VH-CDR3; two loops were designed (Table 1). 

10 Mutant construction and production 

Site-directed mutagenesis experiments were performed to obtain designed 
sequences. Two mutant Fn3s, Dl.3-1 and Dl.3-4 (Table 1) were obtained and both 
were expressed as soluble His # tag fusion proteins. D 1.3-4 was purified and the 
His^tag portion was removed by thrombin cleavage. D 1.3-4 is soluble up to at least 
15 1 mM at pH 7.2. No aggregation of the protein has been observed during sample 
preparation and NMR data acquisition. 

Protein expression and purification 

E. coli BL21 (DE3) (Novagen) were transformed with an expression vector 

20 (pAS25, pEWl and their derivatives) containing a gene for the wild-type or a 

mutant. Cells were grown in M9 minimal medium and M9 medium supplemented 
with Bactotrypton (Difco) containing ampicillin (200 fig/ml). For isotopic labeling, 
I5 N NH 4 CI and/or 13 C glucose replaced unlabeled components. 500 ml medium in a 
2 liter baffle flask were inoculated with 10 ml of overnight culture and agitated at 

25 37°C. Isopropylthio-p-galactoside (IPTG) was added at a final concentration of 1 
mM to initiate protein expression when OD (600 nm) reaches one. The cells were 
harvested by centrifugation 3 hours after the addition of IPTG and kept frozen at - 
70°C until used. 
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Fn3 without His # tag was purified as follows. Cells were suspended in 
5 rnl/(g cell) of Tris (50 mM, pH 7.6) containing ethylenediaminetetraacetic acid 
(EDTA; 1 mM) and phenylmethylsulfonyl fluoride (1 mM). HEL was added to a 
final concentration of 0.5 mg/ml. After incubating the solution for 30 minutes at 
5 37°C, it was sonicated three times for 30 seconds on ice. Cell debris was removed 
by centrifugation. Ammonium sulfate was added to the solution and precipitate 
recovered by centrifugation. The pellet was dissolved in 5-10 ml sodium acetate 
(50 mM, pH 4.6) and insoluble material was removed by centrifugation. The 
solution was applied to a Sephacryl S100HR column (Pharmacia) equilibrated in the 

10 sodium acetate buffer. Fractions containing Fn3 then was applied to a Resources 
column (Pharmacia) equilibrated in sodium acetate (50 mM, pH 4.6) and eluted with 
a linear gradient of sodium chloride (0-0.5 M). The protocol can be adjusted to 
purify mutant proteins with different surface charge properties. 

Fn3 with His* tag was purified as follows. The soluble fraction was prepared 

15 as described above, except that sodium phosphate buffer (50 mM, pH 7.6) 

containing sodium chloride (100 mM) replaced the Tris buffer. The solution was 
applied to a Hi-Trap chelating column (Pharmacia) preloaded with nickel and 
equilibrated in the phosphate buffer. After washing the column with the buffer, 
His«tag-Fn3 was eluted in the phosphate buffer containing 50 mM EDTA. Fractions 

20 containing His*tag-Fn3 were pooled and applied to a Sephacryl S 100-HR column, 
yielding highly pure protein. The His*tag portion was cleaved off by treating the 
fusion protein with thrombin using the protocol supplied by Novagen. Fn3 was 
separated from the His«tag peptide and thrombin by a Resources column using the 
protocol above. 

25 The wild-type and two mutant proteins so far examined are expressed as 

soluble proteins. In the case that a mutant is expressed as inclusion bodies 
(insoluble aggregate), it is first examined if it can be expressed as a soluble protein 
at lower temperature (e.g., 25-30°C). If this is not possible, the inclusion bodies are 
collected by low-speed centrifugation following cell lysis as described above. The 
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pellet is washed with buffer, sonicated and centrifuged. The inclusion bodies are 
solubilized in phosphate buffer (50 mM, pH 7.6) containing guanidinium chloride 
(GdnCl, 6 M) and will be loaded on a Hi-Trap chelating column. The protein is 
eluted with the buffer containing GdnCI and 50 mM EDTA, 

5 

Conformation of mutant Fn3» DL3-4 

The l H NMR spectra of His»tag D 1.3-4 fusion protein closely resembled that 
of the wild-type, suggesting the mutant is folded in a similar conformation to that of 
the wild-type. The spectrum of D 1.3-4 after the removal of the HisHag peptide 

10 showed a large spectral dispersion. A large dispersion of amide protons (7-9.5 ppm) 
and a large number of downfield (5.0-6.5 ppm) C tf protons are characteristic of a p- 
sheet protein (Wuthrich, 1986). 

The 2D NOESY spectrum of DL3-4 provided further evidence for a 
preserved conformation. The region in the spectrum showed interactions between 

15 upfield methyl protons (< 0.5 ppm) and methyl-methylene protons. The Val72 y 
methyl resonances were well separated in the wild-type spectrum (-0.07 and 
0.37 ppm; (Baron et al., 1992)). Resonances corresponding to the two methyl 
protons are present in the D 1.3-4 spectrum (-0.07 and 0.44 ppm). The cross peak 
between these two resonances and other conserved cross peaks indicate that the two 

20 resonances in the D 1 .3-4 spectrum are highly likely those of Val72 and that other 
methyl protons are in nearly identical environment to that of wild-type Fn3. Minor 
differences between the two spectra are presumably due to small structural 
perturbation due to the mutations. Val72 is on the F strand, where it forms a part of 
the central hydrophobic core of Fn3 (Main et al., 1992). It is only four residues 

25 away from the mutated residues of the FG loop (Table 1). The results are 

remarkable because, despite there being 7 mutations and 3 deletions in the loop 
(more than 10% of total residues; Fig. 12, Table 2), Dl.3-4 retains a 3D structure 
virtually identical to that of the wild-type (except for the mutated loop). Therefore, 
the results provide strong support that the FG loop is not significantly contributing 



24 



to the folding and stability of the Fn3 molecule and thus that the FG loop can be 
mutated extensively. 



10 



15 
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Table 2. Sequences of oligonucleotides 

Name Sequence 

FN1F CGGGATCC CATATGC AGGTTTCTGATGTTCCGCGTGACCTGGAAGTTGTTGCTGCGACC 
FN I R TAA CTGCAGG AGCATCCCAGCTGATCAGCAGGCTAGTCGGGGTCGCAGCAACAAC 
FN2F CTCCTCCAGITACCGTGCGT^^ 
FN2R GTGAATTCCIGAACCGGGGAGTTAC^ 

FN3F A GGAATTC ACTGTACCTGGTTCCAAGTCTACTGCTACCATCAGCGG 
FN3R GTATA GTCGACA CCCGGTTTCAGGCCGCTGATGGTAGC 
FN4F CGGG TGTCGACT ATACCATCACTGTATACGCT 

FN4R CGGGATC CGAGCTCG CTGGGCTGTCACCACGGCCAGTAACAGCGTATACAGTGAT 

FN5F CAG CGAGCTC CAAGCCAATCTCGATTAACTACCGT 

FN5R C GGGATCCT CGAGTTACTAGGTACGGTAGTTAATCGA 

FN5R' CGGGATCCACGCGTGCCACCGGTACGGTAGTTAATCGA 

gene3F C GGGATCC ACGCGTCCATTCGTTTGTGAATATCAAGGCCAATCG 

gene3R CCGGAAGCTTTAAGACTCCTTATTACGCAGTATGTTAGC 

38TAABglII CIXjTTACTGGCCGTGAGATCTAACCAGCGAGCTCCA 

BC3 GATCAGCTCKXJATGCTCCTNNKNNK^ 

FG2 TGTATACGGTGTTACTGGCNNKNNKNNKNNKNNKNNKNNKTCCAAGCCAATCTCGAT 
FG3 CTGTATACGCTGTTACTGGCNNKNNKNNKNNKCCAGCGAGCTCCAAG 
FG4 C ATCACTGTATACGCTGTTACTNN KNNKNNKN N KNNKTCC A AGCC AATCTC 
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Restriction enzyme sites are underlined. N and K denote an equimolar mixture of 
A, T. G and C and that of G and T, respectively. 



30 Structure and stability measurements 

Structures of Abs were analyzed using quantitative methods (e.g., DSSP 
(Kabsch & Sander, 1983) and PDBfit (D. McRee, The Scripps Research Institute)) 
as well as computer graphics (e.g., Quanta (Molecular Simulations) and What if (G. 
Vriend, European Molecular Biology Laboratory)) to superimpose the strand-loop- 
35 strand structures of Abs and Fn3. 
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The stability of Fn Abs was determined by measuring temperature- and 
chemical denaturant-induced unfolding reactions (Pace et al, 1989). The 
temperature-induced unfolding reaction was measured using a circular dichroism 
(CD) polarimeter. Ellipticity at 222 and 215 nm was recorded as the sample 
5 temperature was slowly raised. Sample concentrations between 10 and 50 were 
used. After the unfolding baseline was established, the temperature was lowered to 
examine the reversibility of the unfolding reaction. Free energy of unfolding was 
determined by fitting data to the equation for the two-state transition (Becktel & 
Schellman, 1987; Pace et al., 1989). Nonlinear least-squares fitting was performed 

10 using the program Igor (WaveMetrics) on a Macintosh computer. 

The structure and stability of two selected mutant Fn3s were studied; the first 
mutant was D 1.3-4 (Table 2) and the second was a mutant called AS40 which 
contains four mutations in the BC loop (A 26 V 27 T 28 V 29 ) - TQRQ). AS40 was 
randomly chosen from the BC loop library described above. Both mutants were 

15 expressed as soluble proteins in E, coli and were concentrated at least to 1 mM, 
permitting NMR studies. 

The mid-point of the thermal denaturation for both mutants was 
approximately 69°C, as compared to approximately 79°C for the wild-type protein. 
The results indicated that the extensive mutations at the two surface loops did not 

20 drastically decrease the stability of Fn3, and thus demonstrated the feasibility of 
introducing a large number of mutations in both loops. 

Stability was also determined by guanidinium chloride (GdnCl)- and urea- 
induced unfolding reactions. Preliminary unfolding curves were recorded using a 
fluorometer equipped with a motor-driven syringe; GdnCl or urea were added 

25 continuously to the protein solution in the cuvette. Based on the preliminary 

unfolding curves, separate samples containing varying concentration of a denaturant 
were prepared and fluorescence (excitation at 290 nm, emission at 300-400 nm) or 
CD (ellipticity at 222 and 215 nm) were measured after the samples were 
equilibrated at the measurement temperature for at least one hour. The curve was 
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fitted by the least-squares method to the equation for the two-state model (Santoro & 
Bolen, 1988; Koide et al., 1993). The change in protein concentration was 
compensated if required. 

Once the reversibility of the thermal unfolding reaction is established, the 
5 unfolding reaction is measured by a Microcal MC-2 differential scanning 
calorimeter (DSC). The cell (~ 1.3 ml) will be filled with FnAb solution (0.1 - 
1 mM) and ACp (= AH/ AT) will be recorded as the temperature is slowly raised. T m 
(the midpoint of unfolding), AH of unfolding and AG of unfolding is determined by 
fitting the transition curve (Privalov & Potekhin, 1986) with the Origin software 
1 0 provided by Microcal. 

Thermal unfolding 

A temperature-induced unfolding experiment on Fn3 was performed using 
circular dichroism (CD) spectroscopy to monitor changes in secondary structure. 

15 The CD spectrum of the native Fn3 shows a weak signal near 222 nm (Fig. 3 A), 
consistent with the predominantly p-structure of Fn3 (Perczel et al, 1992). A 
cooperative unfolding transition is observed at 80-90°C, clearly indicating high 
stability of Fn3 (Fig. 3B). The free energy of unfolding could not be determined due 
to the lack of a post-transition baseline. The result is consistent with the high 

20 stability of the first Fn3 domain of human fibronectin (Litvinovich et al., 1992), thus 
indicating that Fn3 domains are in general highly stable. 

Binding assays 

Binding reaction of FnAbs were characterized quantitatively using an 
25 isothermal titration calorimeter (ITC) and fluorescence spectroscopy. 

The enthalpy change (AH) of binding were measured using a Microcal 
Omega ITC (Wiseman et al., 1989). The sample cell (~ 1.3 ml) was filled with 
FnAbs solution (< 100 pM, changed according to IQ), and the reference cell filled 
with distilled water; the system was equilibrated at a given temperature until a stable 
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baseline is obtained; 5-20 pi of ligand solution (< 2 mM) was injected by a motor- 
driven syringe within a short duration (20 sec) followed by an equilibration delay 
(4 minutes); the injection was repeated and heat generation/absorption for each 
injection was measured. From the change in the observed heat change as a function 
5 of ligand concentration, AH and K d was determined (Wiseman et al., 1989). AG and 
AS of the binding reaction was deduced from the two directly measured parameters. 
Deviation from the theoretical curve was examined to assess nonspecific (multiple- 
site) binding. Experiments were also be performed by placing a ligand in the cell 
and titrating with an FnAb. It should be emphasized that only FTC gives direct 

10 measurement of AH, thereby making it possible to evaluate enthaipic and entropic 
contributions to the binding energy. FTC was successfully used to monitor the 
binding reaction of the D1.3 Ab (Tello et al., 1993; Bhat et al., 1994). 

Intrinsic fluorescence is monitored to measure binding reactions with in 
the sub-pM range where the determination of by ETC is difficult. Trp 

15 fluorescence (excitation at - 290 nm, emission at 300-350 nm) and Tyr fluorescence 
(excitation at ~ 260 nm, emission at - 303 nm) is monitored as the Fn3~mutant 
solution (< 10 (iM) is titrated with ligand solution (< 100 pM). K 4 of the reaction is 
determined by the nonlinear least-squares fitting of the bimolecular binding 
equation. Presence of secondary binding sites is examined using Scatchard analysis. 

20 In all binding assays, control experiments are performed busing wild-type Fn3 (or 
unrelated FnAbs) in place of Fn Abs of interest. 

II. Production of Fn3 mutants with high affinity and specificity FnAbs 
Library screening was carried out in order to select FnAbs which bind to 
25 specific ligands. This is complementary to the modeling approach described above. 
The advantage of combinatorial screening is that one can easily produce and screen 
a large number of variants 10 8 ), which is not feasible with specific mutagenesis 
("rational design*') approaches. The phage display technique (Smith, 1985; O'Neil 
& Hoess, 1995) was used to effect the screening processes. Fn3 was fused to a 
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phage coat protein (pill) and displayed on the surface of filamentous phages. These 
phages harbor a single-stranded DNA genome that contains the gene coding the Fn3 
fusion protein. The amino acid sequence of defined regions of Fn3 were 
randomized using a degenerate nucleotide sequence, thereby constructing a library. 
5 Phages displaying Fn3 mutants with desired binding capabilities were selected in 
vitro^ recovered and amplified. The amino acid sequence of a selected clone can be 
identified readily by sequencing the Fn3 gene of the selected phage. The protocols 
of Smith (Smith & Scott, 1993) were followed with minor modifications. 

The objective was to produce FnAbs which have high affinity to small 

10 protein ligands. HEL and the Bl domain of staphylococcal protein G (hereafter 
referred to as protein G) were used as ligands. Protein G is small (56 amino acids) 
and highly stable (Minor & Kim, 1994; Smith et aL, 1994). Its structure was 
determined by NMR spectroscopy (Gronenborn et aL, 1991) to be a helix packed 
against a four-strand P-sheet. The resulting FnAb-protein G complexes (~ 150 

15 residues) is one of the smallest protein-protein complexes produced to date, well 
within the range of direct NMR methods. The small size, the high stability and 
solubility of both components and the ability to label each with stable isotopes ( 13 C 
and 15 N; see below for protein G) make the complexes an ideal model system for 
NMR studies on protein-protein interactions. 

20 The successful loop replacement of Fn3 (the mutant D 1 .3-4) demonstrate 

that at least ten residues can be mutated without the loss of the global fold. Based 
on this, a library was first constructed in which only residues in the FG loop are 
randomized. After results of loop replacement experiments on the BC loop were 
obtained, mutation sites were extended that include the BC loop and other sites. 

25 

Construction of Fn3 phage display system 

An M13 phage-based expression vector pASMl has been constructed as 
follows: an oligonucleotide coding the signal peptide of OmpT was cloned at the 5' 
end of the Fn3 gene; a gene fragment coding the C-terminal domain of M13 pin was 
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prepared from the wild-type gene HI gene of M13 mpl8 using PCR (Corey et aL, 
1993) and the fragment was inserted at the 3' end of the OmpT-Fn3 gene; a spacer 
sequence has been inserted between Fn3 and pin. The resultant fragment (OmpT- 
Fn3-pIH) was cloned in the multiple cloning site of M13 mpl8, where the fusion 
5 gene is under the control of the lac promoter. This system will produce the Fn3-pin 
fusion protein as well as the wild-type pin protein. The co-expression of wild-type 
pin is expected to reduce the number of fusion pIH protein, thereby increasing the 
phage infectivity (Corey et aL, 1993) (five copies of pin are present on a phage 
particle). In addition, a smaller number of fusion pIH protein may be advantageous 

10 in selecting tight binding proteins, because the chelating effect due to multiple 
binding sites should be smaller than that with ail five copies of fusion pm (Bass et 
aL, 1990), This system has successfully displayed the serine protease trypsin (Corey 
et aL, 1993). Phages were produced and purified using E. coli K91kan (Smith & 
Scott, 1993) according to a standard method (Sambrook et aL, 1989) except that 

15 phage particles were purified by a second polyethylene glycol precipitation and acid 
precipitation. 

Successful display of Fn3 on fusion phages has been confirmed by ELIS A 
using an Ab against fibronectin (Sigma), clearly indicating that it is feasible to 
construct libraries using this system. 

20 An alternative system using the fUSE5 (Parmley & Smith, 1988) may also be 

used. The Fn3 gene is inserted to fUSE5 using the Sfil restriction sites introduced at 
the 5'- and 3'- ends of the Fn3 gene PCR. This system displays only the fusion pIH 
protein (up to five copies) on the surface of a phage. Phages are produced and 
purified as described (Smith & Scott, 1993). This system has been used to display 

25 many proteins and is robust. The advantage of fUSE5 is its low toxicity. This is 
due to the low copy number of the replication form (RF) in the host, which in turn 
makes it difficult to prepare a sufficient amount of RF for library construction 
(Smith & Scott, 1993). 
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Construction of libraries 

The first library was constructed of the Fn3 domain displayed on the surface 
of MB phage in which seven residues (77-83) in the FG loop (Fig. 4D) were 
randomized. Randomization will be achieved by the use of an oligonucleotide 
5 containing degenerated nucleotide sequence. A double-stranded nucleotide was 
prepared by the same protocol as for gene synthesis (see above) except that one 
strand had an (NNK) 6 (NNG) sequence at the mutation sites, where N corresponds to 
an equimolar mixture of A, T, G and C and K corresponds to an equimolar mixture 
of G and T. The (NNG) codon at residue 83 was required to conserve the SacI 

10 restriction site (Fig. 2). The (NNK) codon codes all of the 20 amino acids, while the 
NNG codon codes 14. Therefore, this library contained ~ 10 9 independent 
sequences. The library was constructed by ligating the double-stranded nucleotide 
into the wild-type phage vector, pASMl, and the transfecting E. coli XL1 blue 
(Stratagene) using electroporation. XL1 blue has the lacF phenotype and thus 

15 suppresses the expression of the Fn3-pIE fusion protein in the absence of lac 

inducers. The initial library was propagated in this way, to avoid selection against 
toxic Fn3-pIII clones. Phages displaying the randomized Fn3-pIII fusion protein 
were prepared by propagating phages with K91kan as the host. K91kan does not 
suppress the production of the fusion protein, because it does not have lacF. 

20 Another library was also generated in which the BC loop (residues 26-20) was 
randomized. 

Selection of displayed FnAbs 

Screening of Fn3 phage libraries was performed using the biopanning 
25 protocol (Smith & Scott, 1993); a ligand is biotinylated and the strong biotin- 

streptavidin interaction was used to immobilize the ligand on a streptavidin-coated 
dish. Experiments were performed at room temperature (~ 22°C). For the initial 
recovery of phages from a library, 10 |ig of a biotinylated ligand were immobilized 
on a streptavidin-coated polystyrene dish (35 mm, Falcon 1008) and then a phage 
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solution (containing ~ 10 11 pfu (plaque-forming unit)) was added. After washing the 
dish with an appropriate buffer (typically TBST, Tris-HCl (50 mM, pH 7.5), NaCl 
(150 mM) and Tween 20 (0.5%)), bound phages were eluted by one or combinations 
of the following conditions: low pH, an addition of a free ligand, urea (up to 6 M) 
5 and, in the case of anti-protein G FnAbs, cleaving the protein G-biotin linker by 
thrombin. Recovered phages were amplified using the standard protocol using 
K9 Ikan as the host (Sambrook et aL, 1989). The selection process were repeated 3- 
5 times to concentrate positive clones. From the second round on, the amount of the 
ligand were gradually decreased (to ~ 1 |ig) and the biotinylated ligand were mixed 

10 with a phage solution before transferring a dish (G. P. Smith, personal 

communication). After the final round, 10-20 clones were picked, and their DNA 
sequence will be determined. The ligand affinity of the clones were measured first 
by the phage-ELIS A method (see below). 

To suppress potential binding of the Fn3 framework (background binding) to 

15 a ligand, wild-type Fn3 may be added as a competitor in the buffers. In addition, 
unrelated proteins (e.g., bovine serum albumin, cytochrome c and RNase A) may be 
used as competitors to select highly specific FnAbs. ' 

Binding assay 

20 The binding affinity of FnAbs on phage surface is characterized semi- 

quantitatively using the phage ELBA technique (Li et aL, 1995). Wells of 
microtiter plates (Nunc) are coated with a ligand protein (or with streptavidin 
followed by the binding of a biotinylated ligand) and blocked with the Blotto 
solution (Pierce). Purified phages (~ 10 10 pfu) originating from single plaques 

25 (M13)/colonies (fUSE5) are added to each well and incubated overnight at 4°C. 
After washing wells with an appropriate buffer (see above), bound phages are 
detected by the standard ELISA protocol using anti-M13 Ab (rabbit, Sigma) and 
an ti -rabbit Ig-peroxidase conjugate (Pierce) or using anti-M13 Ab-peroxidase 
conjugate (Pharmacia). Colormetric assays are performed using TMB (3,3',S,5'- 
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tetramethylbenzidine, Pierce), The high affinity of protein G to immunoglobulins 
present a special problem; Abs cannot be used in detection. Therefore, to detect 
anti-protein G FnAbs, fusion phages are immobilized in wells and the binding is 
then measured using biotinylated protein G followed by the detection using 
5 streptavidin-peroxidase conjugate. 

Production of soluble FnAbs 

After preliminary characterization of mutant Fn3s using phage ELIS A, 
mutant genes are subcloned into the expression vector pEWL Mutant proteins are 
10 produced as His # tag fusion proteins and purified, and their conformation, stability 
and ligand affinity are characterized. 

Thus, Fn3 is the fourth example of a monomeric immunoglobulin-like 
scaffold that can be used for engineering binding proteins. Successful selection of 

15 novel binding proteins have also been based on minibody, tendamistat and 

"camelized" immunoglobulin VH domain scaffolds (Martin et al., 1994; Davies & 
Riechmann, 1995; McConneil & Hoess, 1995). The Fn3 scaffold has advantages 
over these systems. Bianchi et aL reported that the stability of a minibody was 2.5 
kcal/mol, significantly lower than that of Ubi4-K. No detailed structural 

20 characterization of minibodies has been reported to date. Tendamistat and the VH 
domain contain disulfide bonds, and thus preparation of correctly folded proteins 
may be difficult. Davies and Riechmann reported that the yields of their camelized 
VH domains were less than 1 mg per liter culture (Davies & Riechmann, 1996). 
Thus, the Fn3 framework can be used as a scaffold for molecular 

25 recognition. Its small size, stability and well-characterized structure make Fn3 an 
attractive system. In light of the ubiquitous presence of Fn3 in a wide variety of 
natural proteins involved in ligand binding, one can engineer Fn3-based binding 
proteins to different classes of targets. 

The following examples are intended to illustrate but not limit the invention. 
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EXAMPLE I 
Construction of the Fn3 gene 

A synthetic gene for tenth Fn3 of fibronectin (Fig. 1) was designed on the 
basis of amino acid residue 1416-1509 of human fibronectin (Kornblihtt, et al, 
5 1985) and its three dimensional structure (Main, et ai, 1992). The gene was 

engineered to include convenient restriction sites for mutagenesis and the so-called 
"preferred codons" for high level protein expression (Gribskov, et aL, 1984) were 
used. In addition, a glutamine residue was inserted after the N-terminal methionine 
in order to avoid partial processing of the N-terminal methionine which often 

10 degrades NMR spectra (Smith, et al, 1994). Chemical reagents were of the 

analytical grade or better and purchased from Sigma Chemical Company and JX 
Baker, unless otherwise noted. Recombinant DNA procedures were performed as 
described in "Molecular Cloning" (Sambrook, et aL, 1989), unless otherwise stated. 
Custom oligonucleotides were purchased from Operon Technologies. Restriction 

15 and modification enzymes were from New England Biolabs. 

The gene was assembled in the following manner. First, the gene sequence 
(Fig. 5) was divided into five parts with boundaries at designed restriction sites: 
fragment 1, Ndel-PstI (oligonucleotides FN1F and FN1R (Table 2); fragment 2, 
Pstl-EcoRI (FN2F and FN2R); fragment 3, EcoRI-Sall (FN3F and FN3R); fragment 

20 4, Sail-Sad (FN4F and FN4R); fragment 5, SacI-BamHI (FN5F and FN5R). 

Second, for each part, a pair of oligonucleotides which code opposite strands and 
have complementary overlaps of approximately 15 bases was synthesized. These 
oligonucleotides were designated FN1F-FN5R and are shown in Table 2. Third, 
each pair (e.g., FN1F and FN1R) was annealed and single-strand regions were filled 

25 in using the Klenow fragment of DNA polymerase. Fourth, the double stranded 
oligonucleotide was digested with the relevant restriction enzymes at the termini of 
the fragment and cloned into the pBlueScript SK plasmid (Stratagene) which had 
been digested with the same enzymes as those used for the fragments. The DNA 
sequence of the inserted fragment was confirmed by DNA sequencing using an 
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Applied Biosystems DNA sequencer and the dideoxy termination protocol provided 
by the manufacturer. Last, steps 2-4 were repeated to obtain the entire gene. 

The gene was also cloned into the pET3a and pET15b (Novagen) vectors 
(pAS45 and pAS25, respectively). The maps of the plasmids are shown in Figs. 6 
5 and 7. E. coli BL21 (DE3) (Novagen) containing these vectors expressed the Fn3 
gene under the control of bacteriophage T7 promotor (Studier, et aL, 1990); pAS24 
expresses the 96-residue Fn3 protein only, while pAS45 expresses Fn3 as a fusion 
protein with poly-histidine peptide (His # tag). High level expression of the Fn3 
protein and its derivatives in £. coli was detected as an intense band on SDS-PAGE 
10 stained with CBB. 

The binding reaction of the monobodies is characterized quantitatively by 
means of fluorescence spectroscopy using purified soluble monobodies. 

Intrinsic fluorescence is monitored to measure binding reactions. Trp 
fluorescence (excitation at -290 nm, emission at 300 350 nm) and Tyr fluorescence 
15 (excitation at -260 nm, emission at -303 nm) is monitored as the Fn3-mutant 

solution (< 100 \xM) is titrated with a ligand solution. When a ligand is fluorescent 
(e.g. fluorescein), fluorescence from the ligand may be used. K d of the reaction will 
be determined by the nonlinear least-squares fitting of the bimolecular binding 
equation. 

20 If intrinsic fluorescence cannot be used to monitor the binding reaction, 

monobodies are labeled with fluorescein-NHS (Pierce) and fluorescence 
polarization is used to monitor the binding reaction (Burke et aL, 1996). 

EXAMPLE II 

25 Modifications to include restriction sites in the Fn3 gene 

The restriction sites were incorporated in the synthetic Fn3 gene without 
changing the amino acid sequence Fn3. The positions of the restriction sites were 
chosen so that the gene construction could be completed without synthesizing long 
(>60 bases) oligonucleotides and so that two loop regions could be mutated 
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(including by randomization) by the cassette mutagenesis method (i.e M swapping a 
fragment with another synthetic fragment containing mutations). In addition, the 
restriction sites were chosen so that most sites were unique in the vector for phage 
display. Unique restriction sites allow one to recombine monobody clones which 
5 have been already selected in order to supply a larger sequence space. 

EXAMPLE III 
Construction of M13 phage display libraries 

A vector for phage display, pAS38 (for its map, see Fig. 8) was constructed 
10 as follows. The Xbal-BamHI fragment of pET12a encoding the signal peptide of 
OmpT was cloned at the 5* end of the Fn3 gene* The C-terminal region (from the 
FN5F and FN5R oligonucleotides, see Table 2) of the Fn3 gene was replaced with a 
new fragment consisting of the FN5F and FN5R' oligonucleotides (Table 2) which 
introduced a Mlul site and a linker sequence for making a fusion protein with the 
1 5 pin protein of bacteriophage M 1 3. A gene fragment coding the C-terminal domain 
of M13 pin was prepared from the wild-type gene IH of M13mpl8 using PCR 
(Corey, et ai 9 1993) and the fragment was inserted at the 3 1 end of the OmpT-Fn3 
fusion gene using the Mlul and HindlH sites. 

Phages were produced and purified using a helper phage, M13K07, 
20 according to a standard method (Sambrook, et aL f 1989) except that phage particles 
were purified by a second polyethylene glycol precipitation. Successful display of 
Fn3 on fusion phages was confirmed by ELISA (Harlow & Lane, 1988) using an 
antibody against fibronectin (Sigma) and a custom anti-FN3 antibody (Cocalico 
Biologicals, PA, USA). 
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EXAMPLE IV 
Libraries containing loop variegations in the AB loop 

A nucleic acid phage display library having variegation in the AB loop is 
prepared by the following methods. Randomization is achieved by the use of 

5 oligonucleotides containing degenerated nucleotide sequence. Residues to be 
variegated are identified by examining the X-ray and NMR structures of Fn3 
(Protein Data Bank accession numbers, 1FNA and 1TTF, respectively). 
Oligonucleotides containing NNK (N and K here denote an equimolar mixture of A, 
T, G, and C and an equimolar mixture of G and T, respectively) for the variegated 

10 residues are synthesized (see oligonucleotides BC3, FG2, FG3, and FG4 in Table 2 
for example). The NNK mixture codes for all twenty amino acids and one 
termination codon (TAG). TAG, however, is suppressed in the E. coli XL-1 blue. 
Single-stranded DNAs of pAS38 (and its derivatives) are prepared using a standard 
protocol (Sambrook, etal, 1989). 

1 5 Site-directed mutagenesis is performed following published methods (see for 

example, Kunkel, 1985) using a Muta-Gene kit (BioRad). The libraries are 
constructed by electroporation of E. coli XL-1 Blue electroporation competent cells 
(200 pi; Stratagene) with lpg of the plasmid DNA using a BTX electrocell 
manipulator ECM 395 1mm gap cuvette. A portion of the transformed cells is 

20 plated on an LB-agar plate containing ampicillin ( 1 00 ug/ml) to determine the 

transformation efficiency. Typically, 3 X 10 8 transformants are obtained with 1 jag 
of DNA, and thus a library contains 10 8 to 10 9 independent clones. Phagemid 
particles were prepared as described above. 



EXAMPLE V 
Loop variegations in the BC, CD, DE, EF or FG loop 

A nucleic acid phage display library having five variegated residues (residues 
number 26-30) in the BC loop, and one having seven variegated residues (residue 
numbers 78-84) in the FG loop, was prepared using the methods described in 
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Example IV above. Other nucleic acid phage display libraries having variegation in 
the CD, DE or EF loop can be prepared by similar methods. 

EXAMPLE VI 

5 Loop variegations in the FG and BC loop 

A nucleic acid phage display library having seven variegated residues 
(residues number 78-84) in the FG loop and five variegated residues (residue 
number 26-30) in the BC loop was prepared. Variegations in the BC loop were 
prepared by site-directed mutagenesis (Kunkel, et al) using the BC3 oligonucleotide 

10 described in Table 1. Variegations in the FG loop were introduced using site- 
directed mutagenesis using the BC loop library as the starting material, thereby 
resulting in libraries containing variegations in both BC and FG loops. The 
oligonucleotide FG2 has variegating residues 78-84 and oligonucleotide FG4 has 
variegating residues 77-81 and a deletion of residues 82-84. 

15 A nucleic acid phage display library having five variegated residues (residues 

78-84) in the FG loop and a three residue deletion (residues 82-84) in the FG loop, 
and five variegated residues (residues 26-30) in the BC loop, was prepared. The 
shorter FG loop was made in an attempt to reduce the flexibility of the FG loop; the 
loop was shown to be highly flexible in Fn3 by the NMR studies of Main, et al 

20 (1992). A highly flexible loop may be disadvantageous to forming a binding site 
with a high affinity (a large entropy loss is expected upon the iigand binding, 
because the flexible loop should become more rigid). In addition, other Fn3 
domains (besides human) have shorter FG loops (for sequence alignment, see Figure 
12 in Dickinson, etal (1994)). 

25 Randomization was achieved by the use of oligonucleotides containing 

degenerate nucleotide sequence (oligonucleotide BC3 for variegating the BC loop 
and oligonucleotides FG2 and FG4 for variegating the FG loops). 

Site-directed mutagenesis was performed following published methods (see 
for example, Kunkel, 1985). The libraries were constructed by electrotransforming 



38 



E. coli XL-1 Blue (Stratagene). Typically a library contains 10 8 to 10 9 independent 
clones. Library 2 contains five variegated residues in the BC loop and seven 
variegated residues in the FG loop. Library 4 contains five variegated residues in 
each of the BC and FG loops, and the length of the FG loop was shortened by three 
5 residues. 

EXAMPLE VII 
fd phage display libraries constructed with loop variegations 

Phage display libraries are constructed using the fd phage as the genetic 
10 vector. The Fn3 gene is inserted in fUSE5 (Parmley & Smith, 1988) using Sfil 
restriction sites which are introduced at the 5' and 3 ' ends of the Fn3 gene using 
PCR. The expression of this phage results in the display of the fusion pin protein 
on the surface of the fd phage. Variegations in the Fn3 loops are introduced using 
site-directed mutagenesis as described hereinabove, or by subcloning the Fn3 
1 5 libraries constructed in M 1 3 phage into the fUSE5 vector. 



EXAMPLE VIII 
Other phage display libraries 
T7 phage libraries (Novagen, Madison, WI) and bacterial pili expression 
20 systems (Invitrogen) are also useful to express the Fn3 gene. 



EXAMPLE IX 

Isolation of polypeptides which bind to macromolecular structures 

The selection of phage-displayed monobodies was performed following the 
25 protocols of Barbas and coworkers (Rosenblum & Barbas, 1995). Briefly, 

approximately 1 ug of a target molecule ("antigen") in sodium carbonate buffer (100 
mM, pH 8.5) was immobilized in the wells of a microtiter plate (Maxisorp, Nunc) 
by incubating overnight at 4°C in an air tight container. After the removal of this 
solution, the wells were then blocked with a 3% solution of BS A (Sigma, Fraction 
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V) in TBS by incubating the plate at 37 °C for 1 hour. A phagemid library solution 
(50 ul) containing approximately 10 12 colony forming units (cfu) of phagemid was 
absorbed in each well at 37 °C for I hour. The wells were then washed with an 
appropriate buffer (typically TBST, 50 mM Tris-HCl (pH 7.5), 150 mM NaCl, and 

5 0.5% Tween20) three times (once for the first round). Bound phage were eluted by 
an acidic solution (typically, 0.1 M glycine-HCl, pH 2.2; 50 ul) and recovered phage 
were immediately neutralized with 3 ul of Tris solution. Alternatively, bound phage 
were eluted by incubating the wells with 50 ul of TBS containing the antigen (1 - 
10 uM). Recovered phage were amplified using the standard protocol employing 

10 the XLlBlue cells as the host (Sambrook, et al). The selection process was 

repeated 5-6 times to concentrate positive clones. After the final round, individual 
clones were picked and their binding affinities and DNA sequences were 
determined. 

The binding affinities of monobodies on the phage surface were 
15 characterized using the phage ELISA technique (Li, et al, 1995). Wells of 
microliter plates (Nunc) were coated with an antigen and blocked with BSA. 
Purified phages (10 8 - 10" cfu) originating from a single colony were added to each 
well and incubated 2 hours at 37 °C. After washing wells with an appropriate buffer 
(see above), bound phage were detected by the standard ELISA protocol using anti- 
20 M 1 3 antibody (rabbit, Sigma) and anti-rabbit Ig-peroxidase conjugate (Pierce). 
Colorimetric assays were performed using Turbo-TMB (3,3',5,5'- 
tetramethylbenzidine, Pierce) as a substrate. 

The binding affinities of monobodies on the phage surface were further 
characterized using the competition ELISA method (Djavadi-Ohaniance, et al., 
25 1996). In this experiment, phage ELISA is performed in the same manner as 
described above, except that the phage solution contains a ligand at varied 
concentrations. The phage solution was incubated a 4°C for one hour prior to the 
binding of an immobilized ligand in a microtiter plate well. The affinities of phage 
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displayed monobodies are estimated by the decrease in ELISA signal as the free 
ligand concentration is increased. 

After preliminary characterization of monobodies displayed on the surface of 
phage using phage ELISA, genes for positive clones were subcloned into the 

5 expression vector pAS45. E. coli BL21(DE3) (Novagen) was transformed with an 
expression vector (pAS45 and its derivatives). Cells were grown in M9 minimal 
medium and M9 medium supplemented with Bactotryptone (Difco) containing 
ampicillin (200 ng/ml). For isotopic labeling, t5 N NH 4 C1 and/or l3 C glucose 
replaced unlabeled components. Stable isotopes were purchased from Isotec and 

10 Cambridge Isotope Labs. 500 ml medium in a 2 1 baffle flask was inoculated with 
10 ml of overnight culture and agitated at approximately 140 rpm at 37°C IPTG 
was added at a final concentration of 1 mM to induce protein expression when 
OD(600 nm) reached approximately 1.0. The cells were harvested by centrifugation 
3 hours after the addition of IPTG and kept frozen at -70°C until used. 

1 5 Fn3 and monobodies with His*tag were purified as follows. Cells were 

suspended in 5 ml/(g cell) of 50 mM Tris (pH 7.6) containing 1 mM 
phenylmethylsulfonyl fluoride. HEL (Sigma, 3X crystallized) was added to a final 
concentration of 0.5 mg/ml After incubating the solution for 30 min at 37 °C, it was 
sonicated so as to cause cell breakage three times for 30 seconds on ice. Cell debris 

20 was removed by centrifugation at 15,000 rpm in an Sorval RC-2B centrifuge using 
an SS-34 rotor. Concentrated sodium chloride is added to the solution to a final 
concentration of 0.5 M. The solution was then applied to a 1 ml HisTrap™ 
chelating column (Pharmacia) preloaded with nickel chloride (0.1 M, 1 ml) and 
equilibrated in the Tris buffer (50 mM, pH 8.0) containing 0.5 M sodium chloride. 

25 After washing the column with the buffer, the bound protein was eluted with a Tris 
buffer (50 mM, pH 8.0) containing 0.5 M imidazole. The His«tag portion was 
cleaved off, when required, by treating the fusion protein with thrombin using the 
protocol supplied by Novagen (Madison, WI). Fn3 was separated from the His«tag 
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peptide and thrombin by a Resources®column (Pharmacia) using a linear gradient of 
sodium chloride (0 - 0.5 M) in sodium acetate buffer (20 mM, pH 5.0). 

Small amounts of soluble monobodies were prepared as follows. XL-1 Blue 
cells containing pAS38 derivatives (plasmids coding Fn3-pffl fusion proteins) were 

5 grown in LB media at 37 °C with vigorous shaking until OD(600 nm) reached 
approximately 1.0; IPTG was added to the culture to a final concentration of 1 mM, 
and the cells were further grown overnight at 37 °C. Cells were removed from the 
medium by centrifugation, and the supernatant was applied to a microtiter well 
coated with a ligand. Although XL-1 Blue cells containing pAS38 and its 

10 derivatives express FN3-pIH fusion proteins, soluble proteins are also produced due 
to the cleavage of the linker between the Fn3 and pin regions by proteolytic 
activities of E. coli (Rosenblum & Barbas, 1995). Binding of a monobody to the 
ligand was examined by the standard ELISA protocol using a custom antibody 
against Fn3 (purchased from Cocalico Biologicals, Reamstown, PA). Soluble 

1 5 monobodies obtained from the periplasmic fraction of E. coli cells using a standard 
osmotic shock method were also used. 

EXAMPLE X 
Ubiquitin binding monobody 

20 Ubiquitin is a small (76 residue) protein involved in the degradation pathway 

in eurkaryotes. It is a single domain globular protein. Yeast ubiquitin was 
purchased from Sigma Chemical Company and was used without further 
purification. 

Libraries 2 and 4, described in Example VI above, were used to select 
25 ubiquitin-binding monobodies. Ubiquitin (1 ug in 50 ul sodium bicarbonate buffer 
(100 mM, pH 8.5)) was immobilized in the wells of a microtiter plate, followed by 
blocking with BS A (3% in TBS). Panning was performed as described above. In 
the first two rounds, 1 ug of ubiquitin was immobilized per well, and bound phage 
were elute with an acidic solution. From the third to the sixth rounds, 0.1 ug of 

42 



ubiquitin was immobilized per well and the phage were eluted either with an acidic 
solution or with TBS containing 10 \xM ubiquitin. 

Binding of selected clones was tested first in the polyclonal mode, i.e., 
before isolating individual clones. Selected clones from all libraries showed 
5 significant binding to ubiquitin. These results are shown in Figure 9. The binding 
to the immobilized ubiquitin of the clones was inhibited almost completely by less 
than 30 \iM soluble ubiquitin in the competition ELISA experiments (see Fig. 10). 
The sequences of the BC and FG loops of ubiquitin-binding monobodies is shown in 
Table 3. 

10 

Table 3* Sequences of ubiquitin-binding monobodies 



Occurrence Tif 

Name BC loop FG loop more than one) 

211 CARRA RWIPLAK 2 

15 212 CWRRA RWVGLAW 

213 CKHRR FADLWWR 

214 CRRGR RGFMWLS 

215 CNWRR RAYRYRW 

411 SRLRR PPWRV 9 

20 422 ARWTL RRWWW 

424 GQRTF RRWWA 



The 41 1 clone, which was the most enriched clone, was characterized using 
phage ELISA. The 411 clone showed selective binding and inhibition of binding in 
25 the presence of about 10 \xM ubiquitin in solution (Fig. 1 1). 

EXAMPLE XI 
Methods for the immobilization of small molecules 

Target molecules were immobilized in wells of a microtiter plate (Maxisorp, 
30 Nunc) as described hereinbelow, and the wells were blocked with BSA. In addition 
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to the use of carrier protein as described below, a conjugate of a target molecule in 
biotin can be made. The biotinylated ligand can then be immobilized to a microtiter 
plate well which has been coated with streptavidin. 

In addition to the use of a carrier protein as described below, one could make 
a conjugate of a target molecule and biotin (Pierce) and immobilize a biotinylated 
ligand to a microtiter plate well which has been coated with streptavidin (Smith and 
Scott, 1993). 

Small molecules may be conjugated with a carrier protein such as bovine 
serum albumin (BSA, Sigma), and passively adsorbed to the microtiter plate well. 
Alternatively, methods of chemical conjugation can also be used. In addition, solid 
supports other than microtiter plates can readily be employed. 

EXAMPLE XII 
Fluorescein binding monobody 

Fluorescein has been used as a target for the selection of antibodies from 
combinatorial libraries (Barbas, et al 1992). NHS -fluorescein was obtained from 
Pierce and used according to the manufacturer's instructions in preparing conjugates 
with BSA (Sigma). Two types of fluorescein-BSA conjugates were prepared with 
approximate molar ratios of 17 (fluorescein) to one (BSA). 

The selection process was repeated 5-6 times to concentrate positive clones. 
In this experiment, the phage library was incubated with a protein mixture (BSA, 
cytochrome C (Sigma, Horse) and RNaseA (Sigma, Bovine), 1 mg/ml each) at room 
temperature for 30 minutes, prior to the addition to ligand coated wells. Bound 
phage were eluted in TBS containing 10 |iM soluble fluorescein, instead of acid 
elution. After the final round, individual clones were picked and their binding 
affinities (see below) and DNA sequences were determined. 
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Table 4. Clones from Library #2 



10 



15 





BC 


EG 


WT 


AVTVR 


RGDSPAS 


pLB24.1 


CNWRR 


RAYRYRW 


pLB24.2 


CMWRA 


RWGMLRR 


pLB24.3 


ARMRE 


RWLRGRY 


pLB24.4 


CARRR 


RRAGWGW 


pLB24.5 


CNWRR 


RAYRYRW 


pLB24.6 


RWRER 


RHPWTER 


pLB24.7 


CNWRR 


RAYRYRW 


pLB24.8 


ERR VP 


RLLLWQR 


pLB24.9 


GRGAG 


FGSFERR 


pLB24.11 


CRWTR 


RRWFDGA 


pLB24.12 


CNWRR 


RAYRYRW 



Clones from Library #4 



20 



25 



30 



WT 


AVTVR 


GRGDS 


pLB25.1 


GQRTF 


RRWWA 


pLB25.2 


GQRTF 


RRWWA 


pLB25.3 


GQRTF 


RRWWA 


pLB25.4 


LRYRS 


GWRWR 


pLB25.5 


GQRTF 


RRWWA 


pLB25.6 


GQRTF 


RRWWA 


pLB25.7 


LRYRS 


GWRWR 


pLB25.9 


LRYRS 


GWRWR 


pLB25.11 


GQRTF 


RRWWA 


pLB25.12 


LRYRS 


GWRWR 
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Preliminary characterization of the binding affinities of selected clones were 
performed using phage ELISA and competition phage ELISA (see Fig. 12 
(Fluorescein- 1) and Fig. 13 (Fluorescein-2)). The four clones tested showed specific 
binding to the Iigand-coated wells, and the binding reactions are inhibited by soluble 
5 fluorescein (see Fig. 13). 

EXAMPLE XIII 
Digoxigenin binding monobody 

Digoxigenin-3-O-methyl-carbonyl-e-aminocapronic acid-NHS (Boehringer 
10 Mannheim) is used to prepare a digoxigenin-BSA conjugate. The coupling reaction 
is performed following the manufacturers' instructions. The digoxigenin-BSA 
conjugate is immobilized in the wells of a microtiter plate and used for panning. 
Panning is repeated 5 to 6 times to enrich binding clones. Because digoxigenin is 
sparingly soluble in aqueous solution, bound phages are eluted from the well using 
15 acidic solution. See Example XIV. 

EXAMPLE XIV 
TSAC (transition state analog compound) binding monobodies 
Carbonate hydrolyzing monobodies are selected as follows. A transition 
20 state analog for carbonate hydrolysis, 4-nitrophenyl phosphonate is synthesized by 
an Arbuzov reaction as described previously (Jacobs and Schultz, 1987). The 
phosphonate is then coupled to the carrier protein, BS A, using carbodiimide, 
followed by exhaustive dialysis (Jacobs and Schultz, 1987). The hapten-BS A 
conjugate is immobilized in the wells of a microtiter plate and monobody selection 
25 is performed as described above. Catalytic activities of selected monobodies are 
tested using 4-nitrophenyl carbonate as the substrate. 

Other haptens useful to produce catalytic monobodies are summarized in H. 
Suzuki (1994) and in N. R. Thomas (1994). 
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EXAMPLE XV 
NMR characterization of Fn3 and comparison of the Fn3 
secreted by yeast with that secreted by E. coli 
Nuclear magnetic resonance (NMR) experiments are performed to identify 

5 the contact surface between FnAb and a target molecule, e.g., monobodies to 
fluorescein, ubiquitin, RNaseA and soluble derivatives of digoxigenin. The 
information is then be used to improve the affinity and specificity of the monobody. 
Purified monobody samples are dissolved in an appropriate buffer for NMR 
spectroscopy using Amicon ultrafiltration cell with a YM-3 membrane. Buffers are 

10 made with 90 % H 2 O/10 % D 2 0 (distilled grade, Isotec) or with 100 % D 2 0. 

Deuterated compounds (e.g. acetate) are used to eliminate strong signals from them. 

NMR experiments are performed on a Varian Unity INOVA 600 
spectrometer equipped with four RF channels and a triple resonance probe with 
pulsed field gradient capability. NMR spectra are analyzed using processing 

15 programs such as Felix (Molecular Simulations), nmrPipe, PEPP, and CAPP 
(Garrett, etal, 1991; Delaglio, et aL, 1995) on UNIX workstations. Sequence 
specific resonance assignments are made using well-established strategy using a set 
of triple resonance experiments (CBCA(CO)NH and HNCACB) (Grzesiek & Bax, 
1992; Wittenkind & Mueller, 1993). 

20 Nuclear Overhauser effect (NOE) is observed between l H nuclei closer than 

approximately 5 A, which allows one to obtain information on interproton distances. 
A series of double- and triple-resonance experiments (Table 5; for recent reviews on 
these techniques, see Bax & Grzesiek, 1993 and Kay, 1995) are performed to collect 
distance (i.e. NOE) and dihedral angle (J-coupIing) constraints. Isotope-filtered 

25 experiments are performed to determine resonance assignments of the bound ligand 
and to obtain distance constraints within the ligand and those between FnAb and the 
ligand. Details of sequence specific resonance assignments and NOE peak 
assignments have been described in detail elsewhere (Clore & Gronenborn, 1991; 
Pascal, etaL, 1994b; Metzler, et a/., 1996). 



47 



Table 5. NMR experiments for structure characterization 



Experiment Name Reference 

1. reference spectra 

5 

2D-'H, IS N-HSQC (Bodenhausen & Ruben, 1980; Kay, et al, 1992) 

2D-'H, ,3 C-HSQC (Bodenhausen & Ruben, 1980; Vuister & Bax, 1992) 

2. backbone and side chain resonance assignments of l3 C/ l5 N-labeIed protein 

10 

3D-CBCA(CO)NH 
3D-HNCACB 
3D-C(CO)NH 
3D-H(CCO)NH 
15 3 D-HBHA(CBCACO)NH 
3D-HCCH-TOCSY 
3D-HCCH-COSY 
3D-*H, 15 N-TOCSY-HSQC 
2D-HB(CBCDCE)HE 

20 

3. resonance assignments of unlabeled ligand 

2D-isotope-filtered 'H-TOCSY 
2D-isotope-filtered 'H-COSY 
25 2D-isotope-filtered l H-NOESY (Ikura & Bax, 1992) 

4. structural constraints 
within labeled protein 

3D-'H, 15 N-NOESY-HSQC (Zhang et al., 1994) 

30 4D-'H, l3 C-HMQC-NOESY-HMQC (Vuister era/., 1993) 

4D- 1 H, 13 C, ,5 N-HSQC-NOESY-HSQC (Muhandiram etal, 1993; Pascal etal, 1994a) 



(Grzesiek & Bax, 1992) 

(Wittenkind & Mueller, 1993) 

(Logan et al, 1992; Grzesiek et al., 1993) 

(Grzesiek & Bax, 1993) 
(Kay etal., 1993) 
(Ikura era/., 1991) 
(Zhang etal., 1994) 
(Yamazaki etal, 1993) 
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within unlabeled ligand 

2D-isotope-filtered l H-NOESY (Dcura & Bax, 1992) 

interactions between protein and ligand 
3D-isotope-filtered 1 H, l5 N-NOESY-HSQC 
5 3D-isotope-filtered l H, ,3 C-NOESY-HSQC (Lee et al, 1994) 

5. dihedral angle constraints 

J-moIulated 1 H, l5 N-HSQC (Billeter et ai, 1992) 

10 3D-HNHB (Archer et al y 1991) 



Backbone 1 H, 15 N and 13 C resonance assignments for a monobody are 
compared to those for wild-type Fn3 to assess structural changes in the mutant. 

1 5 Once these data establish that the mutant retains the global structure, structural 
refinement is performed using experimental NOE data. Because the structural 
difference of a monobody is expected to be minor, the wild-type structure can be 
used as the initial model after modifying the amino acid sequence. The mutations 
are introduced to the wild-type structure by interactive molecular modeling, and then 

20 the structure is energy-minimized using a molecular modeling program such as 
Quanta (Molecular Simulations). Solution structure is refined using cycles of 
dynamical simulated annealing (Nilges et aL t 1988) in the program X-PLOR 
(Briinger, 1992). Typically, an ensemble of fifty structures is calculated. The 
validity of the refined structures is confirmed by calculating a fewer number of 

25 structures from randomly generated initial structures in X-PLOR using the YAS AP 
protocol (Nilges, et al, 1991). Structure of a monobody-ligand complex is 
calculated by first refining both components individually using intramolecular 
NOEs, and then docking the two using intermolecular NOEs. 

For example, the l H, 15 N-HSQC spectrum for the fluorescein-binding 

30 monobody LB25.5 is shown in Figure 14. The spectrum shows a good dispersion 
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(peaks are spread out) indicating that LB25.5 is folded into a globular conformation. 
Further, the spectrum resembles that for the wild-type Fn3, showing that the overall 
structure of LB25.5 is similar to that of Fn3. These results demonstrate that ligand- 
binding monobodies can be obtained without changing the global fold of the Fn3 
5 scaffold. 

Chemical shift perturbation experiments are performed by forming the 
complex between an isotope-labeled FnAb and an unlabeled ligand. The formation 
of a stoichiometric complex is followed by recording the HSQC spectrum. Because 
chemical shift is extremely sensitive to nuclear environment, formation of a 

10 complex usually results in substantial chemical shift changes for resonances of 
amino acid residues in the interface. Isotope-edited NMR experiments (2D HSQC 
and 3D CBCA(CO)NH) are used to identify the resonances that are perturbed in the 
labeled component of the complex; i.e. the monobody. Although the possibility of 
artifacts due to long-range conformational changes must always be considered, 

15 substantial differences for residues clustered on continuous surfaces are most likely 
to arise from direct contacts (Chen et ai, 1993; Gronenborn & Clore, 1993). 

An alternative method for mapping the interaction surface utilizes amide 
hydrogen exchange (HX) measurements. HX rates for each amide proton are 
measured for l5 N labeled monobody both free and complexed with a ligand. Ligand 

20 binding is expected to result in decreased amide HX rates for monobody residues in 
the interface between the two proteins, thus identifying the binding surface. HX 
rates for monobodies in the complex are measured by allowing HX to occur for a 
variable time following transfer of the complex to D 2 0; the complex is dissociated 
by lowering pH and the HSQC spectrum is recorded at low pH where amide HX is 

25 slow. Fn3 is stable and soluble at low pH, satisfying the prerequisite for the 
experiments. 



EXAMPLE XVI 

Construction and Analysis of Fn3-Display System Specific for Ubiquitin 
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An Fn3-display system was designed and synthesized, ubiquitin-binding 
clones were isolated and a major Fn3 mutant in these clones was biophysically 
characterized. 

Gene construction and phage display of Fn3 was performed as in Examples I 
5 and II above. The Fn3-phage plU fusion protein was expressed from a phagemid- 
display vector, while the other components of the M13 phage, including the wild- 
type pill, were produced using a helper phage (Bass et al., 1990). Thus, a phage 
produced by this system should contain less than one copy of Fn3 displayed on the 
surface. The surface display of Fn3 on the phage was detected by ELISA using an 
10 anti-Fn3 antibody. Only phages containing the Fn3-pin fusion vector reacted with 
the antibody. 

After confirming the phage surface to display Fn3, a phage display library of 
Fn3 was constructed as in Example EL Random sequences were introduced in the 
BC and FG loops. In the first library, five residues (77-8 1) were randomized and 

1 5 three residues (82-84) were deleted from the FG loop. The deletion was intended to 
reduce the flexibility and improve the binding affinity of the FG loop. Five residues 
(26-30) were also randomized in the BC loop in order to provide a larger contact 
surface with the target molecule. Thus, the resulting library contains five 
randomized residues in each of the BC and FG loops (Table 6). This library 

20 contained approximately 10 8 independent clones. 

TJhrary Screening 

Library screening was performed using ubiquitin as the target molecule. In 
each round of panning, Fn3-phages were absorbed to a ubiquitin-coated surface, and 
25 bound phages were eluted competitively with soluble ubiquitin. The recovery ratio 
improved from 4.3 x 10~ 7 in the second round to 4.5 x 10* in the fifth round, 
suggesting an enrichment of binding clones. After five founds of panning, the 
amino acid sequences of individual clones were determined (Table 6). 
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Table 6. Sequences in the variegated loops of enriched clones 



Name 


BC loop 


FG loop 


Frequency 


Wild Type 


GCAGTTACCGTGCGT 


GGCCGTGGTGACAGCCCAGCGAGC 


- 




AlaValThrValArg 


GlyArgGlyAspSerProAlaSer 




Library 51 


NNKNNKNNKNNKNNK 


NNKNNKNNKNNKNNK 


- 




X X X X X 


X X X X X (deletion) 




clone 1 


TCGAGGTTGCGGCGG 


CCGCCGTGGAGGGTG 


9 


(Ubi4) 


SerArgLeuArgArg 


ProProTrpArgVal 




clone2 


GGTCAGCGAACTTTT 


AGGCGGTGGTGGGCT 


1 




GlyGlnArgThrPhe 


ArgArgTrpTrpAla 




cione3 


GCGAGGTGGACGCTT 


AGGCGGTGGTGGTGG 


i 




AlaArgTrpThrLeu 


ArgArgTrpTrpTrp 





10 1 N denotes an equimolar mixture of A, T, G and C; K denotes an equimolar mixture of G and T. 

A clone, dubbed Ubi4, dominated the enriched pool of Fn3 variants. Therefore, 
further investigation was focused on this Ubi4 clone. Ubi4 contains four mutations 
in the BC loop (Arg 30 in the BC loop was conserved) and five mutations and three 

15 deletions in the FG loop. Thus 13% (12 out of 94) of the residues were altered in 
Ubi4 from the wild-type sequence. 

Figure 15 shows a phage ELIS A analysis of Ubi4. The Ubi4 phage binds to 
the target molecule, ubiquitin, with a significant affinity, while a phage displaying 
the wild-type Fn3 domain or a phase with no displayed molecules show little 

20 detectable binding to ubiquitin (Figure 15a). In addition, the Ubi4 phage showed a 
somewhat elevated level of background binding to the control surface lacking the 
ubiquitin coating. A competition ELISA experiments shows the IC^ (concentration 
of the free ligand which causes 50% inhibition of binding) of the binding reaction is 
approximately 5 \iM (Fig. 15b). BSA, bovine ribonuclease A and cytochrome C 

25 show little inhibition of the Ubi4-ubiquitin binding reaction (Figure 15c), indicating 
that the binding reaction of Ubi4 to ubiquitin does result from specific binding. 
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Characterization of a Mutant Fn3 Protein 

The expression system yielded 50-100 mg Fn3 protein per liter culture, A 
similar level of protein expression was observed for the Ubi4 clone and other mutant 
Fn3 proteins. 

5 Ubi4-Fn3 was expressed as an independent protein. Though a majority of 

Ubi4 was expressed in E. coli as a soluble protein, its solubility was found to be 
significantly reduced as compared to that of wild-type Fn3. Ubi4 was soluble up to 
-20 |iM at low pH, with much lower solubility at neutral pH. This solubility was 
not high enough for detailed structural characterization using NMR spectroscopy or 

1 0 X-ray crystallography. 

The solubility of the Ubi4 protein was improved by adding a solubility tail, 
GKKGK, as a C-terminal extension. The gene for Ubi4-Fn3 was subcloned into the 
expression vector pAS45 using PCR. The C-terminal solubilization tag, GKKGK, 
was incorporated in this step. E. coli BL21 (DE3) (Novagen) was transformed with 

15 the expression vector (pAS45 and its derivatives). Cells were grown in M9 minimal 
media and M9 media supplemented with Bactotryptone (Difco) containing 
ampicillin (200 ng/ml). For isotopic labeling, I5 N NH 4 C1 replaced unlabeled NH 4 C1 
in the media. 500 ml medium in a 2 liter baffle flask was inoculated with 10 ml of 
overnight culture and agitated at 37°C. IPTG was added at a final concentration of 1 

20 mM to initiate protein expression when OD (600 nm) reaches one. The cells were 
harvested by centrifugation 3 hours after the addition of IPTG and kept frozen at - 
70°C until used. 

Proteins were purified as follows. Cells were suspended in 5 ml/(g cell) of 
Tris (50 mM, pH 7.6) containing phenylmethylsulfonyl fluoride (1 mM). Hen egg 
25 lysozyme (Sigma) was added to a final concentration of 0.5 mg/mL After incubating 
the solution for 30 minutes at 37°C, it was sonicated three times for 30 seconds on 
ice. Cell debris was removed by centrifugation. Concentrated sodium chloride was 
added to the solution to a final concentration of 0.5 M. The solution was applied to 
a Hi-Trap chelating column (Pharmacia) preloaded with nickel and equilibrated in 
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the Tris buffer containing sodium chloride (0.5 M). After washing the column with 
the buffer, histag-Fn3 was eluted with the buffer containing 500 mM imidazole. 
The protein was further purified using a Resources column (Pharmacia) with a NaCl 
gradient in a sodium acetate buffer (20 mM, pH 4.6). 
5 With the GKKGK tail, the solubility of the Ubi4 protein was increased to 

over 1 mM at low pH and up to -50 \xM at neutral pH. Therefore, further analyses 
were performed on Ubi4 with this C-terminal extension (hereafter referred to as 
Ubi4-K). It has been reported that the solubility of a minibody could be 
significantly improved by addition of three Lys residues at the N- or C-termini 

10 (Bianchi et al., 1994). In the case of protein Rop, a non-structured C-terminal tail is 
critical in maintaining its solubility (Smith et al., 1995). 

Oligomerization states of the Ubi4 protein were determined using a size 
exclusion column. The wild-type Fn3 protein was monomeric at low and neutral 
pH's. However, the peak of the Ubi4-K protein was significantly broader than that 

15 of wild-type Fn3, and eluted after the wild-type protein. This suggests interactions 
between Ubi4-K and the column material, precluding the use of size exclusion 
chromatography to determine the oligomerization state of Ubi4. NMR studies 
suggest that the protein is monomeric at low pH. 

The Ubi4-K protein retained a binding affinity to ubiquitin as judged by 

20 ELISA (Figure 15d). However, an attempt to determine the dissociation constant 
using a biosensor (Affinity Sensors, Cambridge, U.K.) failed because of high 
background binding of Ubi4-K-Fn3 to the sensor matrix. This matrix mainly 
consists of dextran, consistent with our observation that interactions between Ubi4- 
K interacts with the cross-linked dextran of the size exclusion column. 

25 

Example XVII 
Stability Measurements of Monobodies 

Guanidine hydrochloride (GuHCl)-induced unfolding and refolding reactions 
were followed by measuring tryptophan fluorescence. Experiments were performed 
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on a Spectronic AB-2 spcctrofluorometer equipped with a motor-driven syringe 
(Hamilton Co.). The cuvette temperature was kept at 30°C. The spectrofluorometer 
and the syringe were controlled by a single computer using a home-built interface. 
This system automatically records a series of spectra following GuHCl titration. An 
5 experiment started with a 1 .5 ml buffer solution containing 5 \xM protein. An 
emission spectrum (300-400 nm; excitation at 290 nm) was recorded following a 
delay (3-5 minutes) after each injection (50 or 100 fil) of a buffer solution 
containing GuHCl. These steps were repeated until the solution volume reached the 
full capacity of a cuvette (3,0 ml). Fluorescence intensities were normalized as 

10 ratios to the intensity at an isofluorescent point which was determined in separate 
experiments. Unfolding curves were fitted with a two-state model using a nonlinear 
least-squares routine (Santoro & Bolen, 1988). No significant differences were 
observed between experiments with delay times (between an injection and the start 
of spectrum acquisition) of 2 minutes and 10 minutes, indicating that the 

15 unfolding/refolding reactions reached close to an equilibrium at each concentration 
point within the delay times used. 

Conformational stability of Ubi4-K was measured using above-described 
GuHCl-induced unfolding method. The measurements were performed under two 
sets of conditions; first at pH 3.3 in the presence of 300 mM sodium chloride, where 

20 Ubi4-K is highly soluble, and second in TBS, which was used for library screening. 
Under both conditions, the unfolding reaction was reversible, and we detected no 
signs of aggregation or irreversible unfolding. Figure 16 shows unfolding 
transitions of Ubi4-K and wild-type Fn3 with the N-terminal (his) 6 tag and the C- 
terminal solubility tag. The stability of wild-type Fn3 was not significantly affected 

25 by the addition of these tags. Parameters characterizing the unfolding transitions are 
listed in Table 7. 



30 
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Table 7. Stability parameters for Ubi4 and wild-type Fn3 as determined by GuHCl- 
induced unfolding 



Protein AG 0 (kcal mol 1 ) m G (kcal mol" 1 M' 1 ) 

5 Ubi4(pH7.5) 4.8 ±0.1 2.12 ±0.04 
Ubi4 (pH 3.3) 6.5 ± 0. 1 2.07 ± 0.02 

Wild-type (pH 7.5) 7.2 ± 0.2 1 .60 ± 0.04 

Wild-type (pH 3.3) 11,2 ±0,1 2.03 ±0.02 

10 AG 0 is the free energy of unfolding in the absence of denaturant; m^ is the 

dependence of the free energy of unfolding on GuHCl concentration. For solution 
conditions, see Figure 4 caption. 

Though the introduced mutations in the two loops certainly decreased the stability of 
15 Ubi4-K relative to wild-type Fn3, the stability of Ubi4 remains comparable to that of 

a "typical" globular protein. It should also be noted that the stabilities of the wild- 
J type and Ubi4-K proteins were higher at pH 3.3 than at pH 7.5. 

y The Ubi4 protein had a significantly reduced solubility as compared to that 

*y of wild-type Fn3, but the solubility was improved by the addition of a solubility tail 

•Si? ' 

P 20 Since the two mutated loops comprise the only differences between the wild-type 

■J and Ubi4 proteins, these loops must be the origin of the reduced solubility. At this 

point, it is not clear whether the aggregation of Ubi4-K is caused by interactions 
between the loops, or by interactions between the loops and the invariable regions of 
n the Fn3 scaffold. 

25 The Ubi4-K protein retained the global fold of Fn3, showing that this 

scaffold can accommodate a large number of mutations in the two loops tested. 
Though the stability of the Ubi4-K protein is significantly lower than that of the 
wild-type Fn3 protein, the Ubi4 protein still has a conformational stability 
comparable to those for small globular proteins. The use of a highly stable domain 
30 as a scaffold is clearly advantageous for introducing mutations without affecting the 
global fold of the scaffold. In addition, the GuHCl-induced unfolding of the Ubi4 
protein is almost completely reversible. This allows the preparation of a correctly 
folded protein even when a Fn3 mutant is expressed in a misfolded form, as in 



inclusion bodies. The modest stability of Ubi4 in the conditions used for library 
screening indicates that Fn3 variants are folded on the phage surface. This suggests 
that a Fn3 clone is selected by its binding affinity in the folded form, not in a 
denatured form. Dickinson et aL proposed that Val 29 and Arg 30 in the BC loop 
5 stabilize Fn3. Val 29 makes contact with the hydrophobic core, and Arg 30 forms 
hydrogen bonds with Gly 52 and Val 75. In Ubi4-Fn3, Val 29 is replaced with Arg, 
while Arg 30 is conserved. The FG loop was also mutated in the library. This loop 
is flexible in the wild-type structure, and shows a large variation in length among 
human Fn3 domains (Main et al., 1992). These observations suggest that mutations 
10 in the FG loop may have less impact on stability. In addition, the N-terminal tail of 
Fn3 is adjacent to the molecular surface formed by the BC and FG loops (Figure 1 
and 17) and does not form a well-defined structure. Mutations in the N-terminal tail 
would not be expected to have strong detrimental effects on stability. Thus, residues 
in the N-terminal tail may be good sites for introducing additional mutations. 

15 

Example XVIII 
NMR Spectroscopy of Ubi4-Fn3 

Ubi4-Fn3 was dissolved in [ 2 H]-Gly HC1 buffer (20 mM, pH 3.3) containing 
NaCl (300 mM) using an Amicon ultrafiltration unit. The final protein 

20 concentration was 1 mM. NMR experiments were performed on a Varian Unity 
INOVA 600 spectrometer equipped with a triple-resonance probe with pulsed field 
gradient The probe temperature was set at 30°C HSQC, TOCS Y-HSQC and 
NOES Y-HSQC spectra were recorded using published procedures (Kay et al., 1992; 
Zhang et al., 1994). NMR spectra were processed and analyzed using the NMRPipe 

25 and NMRView software (Johnson & Blevins, 1994; Delaglio et al., 1995) on UNIX 
workstations. Sequence-specific resonance assignments were made using standard 
procedures (Wuthrich, 1986; Clore & Gronenborn, 1991). The assignments for 
wild-type Fn3 (Baron et al., 1992) were confirmed using a 15 N-labeIed protein 
dissolved in sodium acetate buffer (50 mM, pH 4.6) at 30°C. 
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The three-dimensional structure of Ubi4-K was characterized using this 
heteronuclear NMR spectroscopy method. A high quality spectrum could be 
collected on a 1 mM solution of 15 N-labeled Ubi4 (Figure 17a) at low pH. The 
linewidth of amide peaks of Ubi4-K was similar to that of wild-type Fn3, suggesting 

5 that Ubi4-K is monomelic under the conditions used. Complete assignments for 
backbone l H and 15 N nuclei were achieved using standard *H, 15 N double resonance 
techniques, except for a row of His residues in the N-terminal (His) 6 tag. There 
were a few weak peaks in the HSQC spectrum which appeared to originate from a 
minor species containing the N-terminal Met residue. Mass spectroscopy analysis 

10 showed that a majority of Ubi4-K does not contain the N-terminal Met residue. Fig. 
17 shows differences in i HN and 15 N chemical shifts between Ubi4-K and wild-type 
Fn3. Only small differences are observed in the chemical shifts, except for those in 
and near the mutated BC and FG loops. These results clearly indicate that Ubi4-K 
retains the global fold of Fn3, despite the extensive mutations in the two loops. A 

15 few residues in the N-terminal region, which is close to the two mutated loops, also 
exhibit significant chemical differences between the two proteins. An HSQC 
spectrum was also recorded on a 50 [iM sample of Ubi4-K in TBS. The spectrum 
was similar to that collected at low pH, indicating that the global conformation of 
Ubi4 is maintained between pH 7.5 and 3.3. 

20 

The complete disclosure of all patents, patent documents and publications 
cited herein are incorporated by reference as if individually incorporated. The 
foregoing detailed description and examples have been given for clarity of 
understanding only. No unnecessary limitations are to be understood therefrom. 
25 The invention is not limited to the exact details shown and described for variations 
obvious to one skilled in the art will be included within the invention defined by the 
claims. 
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WHAT IS CLAIMED IS: 



1 . A fibronectin type HI (Fn3) polypeptide monobody comprising a plurality of 
Fn3 p-strand domain sequences that are linked to a plurality of loop region 
sequences, 

wherein one or more of the monobody loop region sequences vary by 
deletion, insertion or replacement of at least two amino acids from the 
corresponding loop region sequences in wild-type Fn3, and 

wherein the P-strand domains of the monobody have at least a 50% 
total amino acid sequence homology to the corresponding amino acid 
sequence of wild-type Fn3's p-strand domain sequences. 

2. The monobody of claim 1, wherein at least one loop region is capable of 
binding to a specific binding partner (SBP) to form a polypeptide:SBP 
complex having a dissociation constant of less than 10" 6 moles/liter. 

3. The monobody of claim 1, wherein at least one loop region is capable of 
catalyzing a chemical reaction with a catalyzed rate constant (k^) and an 
uncatalyzed rate constant (k^J such that the ratio of k^/k^ is greater than 
10. 

4. The monobody of claim 1, wherein one or more of the loop regions comprise 
amino acid residues: 

i) from 15 to 16 inclusive in an AB loop; 

ii) from 22 to 30 inclusive in a BC loop; 

iii) from 39 to 45 inclusive in a CD loop; 

iv) from 5 1 to 55 inclusive in a DE loop; 

v) from 60 to 66 inclusive in an EF loop; and 

vi) from 76 to 87 inclusive in an FG loop. 
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The monobody of claim 1, wherein the monobody loop region sequences 
vary from the wild-type Fn3 loop region sequences by the deletion or 
replacement of at least 2 amino acids. 

The monobody of claim 1, wherein the monobody loop region sequences 
vary from the wild-type Fn3 loop region sequences by the insertion of from 3 
to 25 amino acids. 

An isolated nucleic acid molecule encoding the polypeptide monobody of 
claim 1. 

An expression vector comprising an expression cassette operably linked to 
the nucleic acid molecule of claim 7. 

The expression vector of claim 8, wherein the expression vector is an Ml 3 
phage-based plasmid. 

A host cell comprising the vector of claim 8. 

A method of preparing a fibronectin type HI (Fn3) polypeptide monobody 
comprising the steps of: 

a) providing a DNA sequence encoding a plurality of Fn3 p-strand 
domain sequences that are linked to a plurality of loop region 
sequences wherein at least one loop region contains a unique 
restriction enzyme site; 

b) cleaving the DNA sequence at the unique restriction site; 

c) inserting into the restriction site a DNA segment known to encode a 
peptide capable of binding to a specific binding partner (SBP) or a 
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transition state analog compound (TS AC) so as to yield a DNA 
molecule comprising the insertion and the DNA sequence of (a); and 
d) expressing the DNA molecule so as to yield polypeptide monobody. 

12. A method of preparing a fibronectin type HI (Fn3) polypeptide monobody 
comprising the steps of: 

(a) providing a replicatable DNA sequence encoding a plurality of Fn3 
P-strand domain sequences that are linked to a plurality of loop 
region sequences, wherein the nucleotide sequence of at least one 
loop region is known; 

(b) preparing polymerase chain reaction (PCR) primers sufficiently 
complementary to the known loop sequence so as to be hybridizable 
under PCR conditions, wherein at least one of the primers contains a 
modified nucleic acid sequence to be inserted into the DNA; 

(c) performing polymerase chain reaction using the DNA sequence of (a) 
and the primers of (b); 

(d) annealing and extending the reaction products of (c) so as to yield a 
DNA product; and 

(e) expressing the polypeptide monobody encoded by the DNA product 
of(d). 

13. A method of preparing a fibronectin type m (Fn3) polypeptide monobody 
comprising the steps of: 

a) providing a replicatable DNA sequence encoding a plurality of Fn3 
p-strand domain sequences that are linked to a plurality of loop 
region sequences, wherein the nucleotide sequence of at least one 
loop region is known; 

b) performing site-directed mutagenesis of at least one loop region so 
as to create a DNA sequence comprising an insertion mutation; and 
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c) expressing the polypeptide monobody encoded by the DNA sequence 
comprising the insertion mutation. 

14. A kit for performing the method of any one of claims 1 1-13, comprising a 
replicatable DNA encoding a plurality of Fn3 P-strand domain sequences 
that are linked to a plurality of loop region sequences. 

15. A variegated nucleic acid library encoding Fn3 polypeptide monobodies 
comprising a plurality of nucleic acid species each comprising a plurality of 
loop regions, wherein the species encode a plurality of Fn3 p-strand domain 
sequences that are linked to a plurality of loop region sequences, 

wherein one or more of the loop region sequences vary by deletion, 
insertion or replacement of at least two amino acids from corresponding loop 
region sequences in wild-type Fn3, and 

wherein the P-strand domain sequences of the monobody have at 
least a 50% total amino acid sequence homology to the corresponding amino 
acid sequences of p-strand domain sequences of the wild-type Fn3. 

16. The variegated nucleic acid library of claim 15, wherein one or more of the 
loop regions encodes: 

i) an AB amino acid loop from residue 15 to 16 inclusive; 

ii) a BC amino acid loop from residue 22 to 30 inclusive; 

iii) a CD amino acid loop from residue 39 to 45 inclusive; 

iv) a DE amino acid loop from residue 51 to 55 inclusive; 

v) an EF amino acid loop from residue 60 to 66 inclusive; and 

vi) an FG amino acid loop from residue 76 to 87 inclusive. 
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17. The variegated nucleic acid library of claim 15, wherein the loop region 
sequences vary from the wild-type Fn3 loop region sequences by the deletion 
or replacement of at least 2 amino acids. 

18. The variegated nucleic acid library of claim 15, wherein the monobody loop 
region sequences vary from the wild-type Fn3 loop region sequences by the 
insertion of from 3 to 25 amino acids. 

19. The variegated nucleic acid library of claim 15, wherein a variegated nucleic 
acid sequence comprising from 6 to 75 nucleic acid bases is inserted in any 
one of the loop regions of said species. 

20. The variegated nucleic acid library of claim 15, wherein the variegated 
sequence is constructed so as to avoid one or more codons selected from the 
group consisting of those codons encoding cysteine or the stop codon. 

21. The variegated nucleic acid library of claim 15, wherein the variegated 
nucleic acid sequence is located in the BC loop. 

22. The variegated nucleic acid library of claim 15, wherein the variegated 
nucleic acid sequence is located in the DE loop. 

23. The variegated nucleic acid library of claim 15, wherein the variegated 
nucleic acid sequence is located in the FG loop. 

24. The variegated nucleic acid library of claim 15, wherein the variegated 
nucleic acid sequence is located in the AB loop. 
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25. The variegated nucleic acid library of claim 15, wherein the variegated 
nucleic acid sequence is located in the CD loop. 

26. The variegated nucleic acid library of claim 15, wherein the variegated 
nucleic acid sequence is located in the EF loop. 

27. A peptide display library derived from the variegated nucleic acid library of 
claim 15. 

28. A peptide display library of claim 27, wherein the peptide is displayed on the 
surface of a bacteriophage or virus. 

29. A peptide display library of claim 28, wherein the bacteriophage is M13 or 
fd. 

30. A method of identifying the amino acid sequence of a polypeptide molecule 
capable of binding to a specific binding partner (SBP) so as to form a 
polypeptide:SSP complex wherein the dissociation constant of the said 
polypeptide:SBP complex is less than 10" 6 moles/liter, comprising the steps 
of: 

a) providing a peptide display library according to claim 28; 

b) contacting the peptide display library of (a) with an immobilized or 
separable SBP; 

c) separating the peptide:SBP complexes from the free peptides, 

d) causing the replication of the separated peptides of (c) so as to result 
in a new peptide display library distinguished from that in (a) by 
having a lowered diversity and by being enriched in displayed 
peptides capable of binding the SBP; 
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e) optionally repeating steps (b), (c), and (d) with the new library of (d); 
and 

f) determining the nucleic acid sequence of the region encoding the 
displayed peptide of a species from (d) and deducing the peptide 
sequence capable of binding to the SBP. 

A method of preparing a variegated nucleic acid library encoding Fn3 
polypeptide monobodies having a plurality of nucleic acid species each 
comprising a plurality of loop regions, wherein the species encode a plurality 
of Fn3 P-strand domain sequences that are linked to a plurality of loop 
region sequences, wherein one or more of the loop region sequences vary by 
deletion, insertion or replacement of at least two amino acids from 
corresponding loop region sequences in wild-type Fn3, and wherein the p- 
strand domain sequences of the monobody have at least a 50% total amino 
acid sequence homology to the corresponding amino acid sequences of P- 
strand domain sequences of the wild-type Fn3, comprising the steps of 

a) preparing an Fn3 polypeptide monobody having a predetermined 
sequence; 

b) contacting the polypeptide with a specific binding partner (SBP) so 
as to form a polypeptide:SSP complex wherein the dissociation 
constant of the said polypeptide:SBP complex is less than 10" 6 
moles/liter; 

c) determining the binding structure of the polypeptide:SBP complex by 
nuclear magnetic resonance spectroscopy or X-ray crystallography; 
and 

d) preparing the variegated nucleic acid library, wherein the variegation 
is performed at positions in the nucleic acid sequence which, from 
the information provided in (c), result in one or more polypeptides 
with improved binding to the SBP. 
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32. A method of identifying the amino acid sequence of a polypeptide molecule 
capable of catalyzing a chemical reaction with a catalyzed rate constant, k cal , 
and an uncatalyzed rate constant, k uncat , such that the ratio of k cat /k uncat is 
greater than 10, comprising the steps of: 

a) providing a peptide display library according to claim 28; 

b) contacting the peptide display library of (a) with an immobilized or 
separable transition state analog compound (TS AC) representing the 
approximate molecular transition state of the chemical reaction; 

c) separating the peptide:TSAC complexes from the free peptides; 

d) causing the replication of the separated peptides of (c) so as to result 
in a new peptide display library distinguished from that in (a) by 
having a lowered diversity and by being enriched in displayed 
peptides capable of binding the TS AC; 

e) optionally repeating steps (b), (c), and (d) with the new library of (d); 
and 

f) determining the nucleic acid sequence of the region encoding the 
displayed peptide of a species from (d) and hence deducing the 
peptide sequence. 

33. A method of preparing a variegated nucleic acid library encoding Fn3 
polypeptide monobodies having a plurality of nucleic acid species each 
comprising a plurality of loop regions, wherein the species encode a plurality 
of Fn3 P-strand domain sequences that are linked to a plurality of loop 
region sequences, wherein one or more of the loop region sequences vary by 
deletion, insertion or replacement of at least two amino acids from 
corresponding loop region sequences in wild-type Fn3, and wherein the p- 
strand domain sequences of the monobody have at least a 50% total amino 
acid sequence homology to the corresponding amino acid sequences of p~ 
strand domain sequences of the wild-type Fn3, comprising the steps of 
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a) preparing an Fn3 polypeptide monobody having a predetermined 
sequence, wherein the polypeptide is capable of catalyzing a 
chemical reaction with a catalyzed rate constant, k cat , and an 
uncatalyzed rate constant, k uncat , such that the ratio of k cat /k uncat is 
greater than 10; 

b) contacting the polypeptide with an immobilized or separable 
transition state analog compound (TS AC) representing the 
approximate molecular transition state of the chemical reaction; 

c) determining the binding structure of the polypeptide:TS AC complex 
by nuclear magnetic resonance spectroscopy or X-ray 
crystallography; and 

d) preparing the variegated nucleic acid library, wherein the variegation 
is performed at positions in the nucleic acid sequence which, from 
the information provided in (c), result in one or more polypeptides 
with improved binding to or stabilization of the TS AC. 

34. An isolated polypeptide identified by the method of claim 30 

35. An isolated polypeptide identified by the method of claim 32. 

36. A kit for identifying the amino acid sequence of a polypeptide molecule 
capable of binding to a specific binding partner (SBP) so as to form a 
polypeptide: SSP complex wherein the dissociation constant of the said 
polypeptide:SBP complex is less than 10" 6 moles/liter, comprising the 
peptide display library of claim 28. 

37. A kit for identifying the amino acid sequence of a polypeptide molecule 
capable of catalyzing a chemical reaction with a catalyzed rate constant, k cat , 
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and an uncatalyzed rate constant, k uncat , such that the ratio of k cat /k U[lcat is 
greater than 10, comprising the peptide display library of claim 28. 

38. A polypepetide derived by using the kit of claim 36. 

39. A polypeptide derived by using the kit of claim 37. 
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ABSTRACT 



A fibronectin type HI (Fn3) polypeptide monobody, a nucleic acid molecule 
encoding said monobody, and a variegated nucleic acid library encoding said 

5 monobody, are provided by the invention. Also provided are methods of preparing a 
Fn3 polypeptide monobody, and kits to perform said methods. Further provided is a 
method of identifying the amino acid sequence of a polypeptide molecule capable of 
binding to a specific binding partner (SBP) so as to form a polypeptide:SSP 
complex, and a method of identifying the amino acid sequence of a polypeptide 

10 molecule capable of catalyzing a chemical reaction with a catalyzed rate constant, 
k cat , and an uncatalyzed rate constant, k^, such that the ratio of k^/k^ is greater 
than 10. 
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firw/organixadon/who/which first sends/sent this case to them and by >*4iom/which I hereby declare that I have consented after full disclosure 
to be represented unJe$$/until I instruct Sehwegman, Lundberg, Woessncr & Kluth, P.A- to the contrary. 

Please direct all correspondence in this case to Schwegman* Luhdbetg, Woessner & Kluth, P*A. at the address indicated below: 

P.O, Box 2938, Minneapolis, MN 5S402 
Telephone No. (612)373-6900 
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I hereby declare that frit statements mad© herein of my own knowledge arc true and (hat ail statements made on information and 
belief are be^cved to be true; and farther that tbe$e statements were made with the knowledge that willful false $tatcments and the like so 
made arc punishable by fine or imprisonment, or both, under Section 1001 of Title J 8 of the United States Code and that such wiilM faise 
Statements may jeopardize the validity of the application or any patent issued thereon. 
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§ 1. 56 Doty to disclose information material to patentability. 

(a) A patent by its very nature is affected with a public interest The public interest is best served, and the most effective parent 
examination occurs when, at the rime an application is being examined, the Office is aware of and evaluates the teachings of ail information 
material to patentability. Each individual associated with the filing and prosecution Of a patent application has a duty of candor and good 
faith in dealing with the Office, which includes a duty to disclose to the Office all information known To that individual to be material to 
patentability as defined in this section* The duty to disclose information exists with respect to each pending claim until the claim is canceled 
or withdrawn from consideration, or the application becomes abandoned. Information material to the patentability of a claim that is canceled 
or withdrawn from consideration need not be submitted if the information is not material to the patentability of any claim remaining under 
consideration in the application. There is no duty to submit information which is not material to the patentability of any existing claim. The 
duty to disclose all information known to be material to patentability is deemed to be satisfied if all information known to be material to 
patentability of any claim issued in a patent was cited by the Office or submitted to the Office in the manner prescribed by §§ 1.97(b)-(d) and 
l.9fc. However, no patent will be granted on an application in connection with which fraud on the Office was practiced or attempted or the 
duty of disclosure was violated through bad faith or intentional misconduct. The Office encourages applicants to carefully examine: 

( 1 ) prior art cited in search reports of a foreign patent office in a counterpart application, and 

(2) the closest information over which individuals associated With the filing or prosecution Q f a patent application believe any 
pending claim patentably defines, to make sure that any material information contained therein is disclosed to the Office. 

(b^L Under this section, information is material to patentability when it is not cumulative to information already of record or being 
madeMf record in the application, and 

(1) It establishes, by itself or to combination with other information, a prima facie case of unpatentability of a claim; or 

\: (2) It refutes, or is inconsistent with, a position the applicant cakes in; 

(i) Opposing an argument of unpatentability relied on by the Office, or 

(ii) Asserting an argument of patentability, 

is 

A pnSa facie case of unpatentability is established when the information compels a conclusion that a claim is unpatentable under the 
preponderance of evidence, burden-of-proof stand ard, giving each term in the claim its broadest reasonable construction consistent with the 
specification, and before any consideration is given to evidence which may be submitted in an attempt to establish a contrary conclusion of 
patepabrUty. 

(g Individuals associated with the filing or prosecution of a patent application within the meaning of this section are: 

( 1 ) Each inventor named in the application: 

(2) Each attorney or agent who prepares or prosecutes tbe application; and 

(3) Every other person who is substantively involved in the preparation of prosecution of the application and who is associated 
with the inventor, with the assignee or with anyone to whom there i$ an obligation to assign the application. 

(d) Individuals other than the attorney, agent or inventor may comply with this section by disclosing information to the attorney, 
agent, or inventor. 



S/N Unknown PATENT 
IN THE TTNTTED STATES PA TENT AND TRADEMARK OFFICE 

Applicant: ShoheiKoide Examiner: Unknown 

Serial No.: Unknown Group Art Unit: Unknown 

Filed: Herewith Docket: 109.034US3 

Title: ARTIFICIAL ANTIBODY POLYPEPTIDES 

rOMMTTNTCATTON TINDER 37 C .F.R. $ 1.821 

BOX PATENT APPLICATION 
Commissioner for Patents 
Washington, D.C. 20231 

Sir: 

This Communication is submitted to conform the above-referenced application to the 
requirements of 37 C.F.R. §§ 1.821 through 1.825. 

The paper copy of the Sequence Listing in this application is identical to the computer 
readable copy of the Sequence Listing filed in application Serial No. 09/096,749, filed June 12, 
1998. In accordance with 37 C.F.R. § 1.821(e), please use the last-filed computer readable form in 
that application as the computer readable form for the instant application. A paper copy of the 
Sequence Listing is included in a separately-filed preliminary amendment for incorporation into the 
specification. 

Respectfully submitted, 

SHOHEI KOIDE 

By his Representatives, 

SCHWEGMAN, LUNDBERG, WOESSNER & KLUTH, P.A. 
P.O. Box 2938 
Minneapolis, MN 55402 
(612) 373-6961 
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Ann S. Viksnins 
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I hereby certify that this paper or fee is being deposited with the United States Postal Service "Express Mail Post Office to Addressee service under 
37 C.F.R. 1 .10 on the date indicated above and is addressed to Box PATENT APPLICATION, Aqpis^t Commissioner for Patents, Washington, 
D.C. 20231. 



Shawn I.. Hise 



Printed Name Signatuf. 



SEQUENCE LISTING 



(1) GENERAL INFORMATION 
(i) APPLICANT: Koide, Shohei 

(ii) TITLE OF THE INVENTION: ARTIFICIAL ANTIBODY POLYPEPTIDES 

(iii) NUMBER OF SEQUENCES: 118 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Schwegman, Lundberg, Woessner & Kluth P. A. 

(B) STREET; 121 South Eighth Street, Ste . 1600 

(C) CITY: Minneapolis 

(D) STATE: MN 

(E) COUNTRY: USA 

(F) ZIP: 55402 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ Version 2.0b 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: Unknown 

(B) FILING DATE: Herewith 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER : 09/096,749 

(B) FILING DATE: June 12, 1998 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Ann S. Viksnins 

(B) REGISTRATION NUMBER: 37,74 8 

(C) REFERENCE /DOCKET NUMBER: 109.034US3 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (612) 373-6900 

(B) TELEFAX: (612) 339-3061 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE : NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 



Ala Arg Glu Arg Asp Tyr Arg Leu Asp Tyr Trp Gly Gin Gly 
15 10 



(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Ala Arg Gly Ala Val Val Ser Tyr Tyr Ala Met Asp Tyr Trp Gly Gin 

15 10 15 

Gly 



Ift (2) INFORMATION FOR SEQ ID NO : 3 : 

r] (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 
^ (B) TYPE: amino acid 

IS (C) STRANDEDNESS: single 

;M (D) TOPOLOGY: linear 

g " (ii) MOLECULE TYPE: peptide 

f*% (iii) HYPOTHETICAL: NO 

S (iv) ANTI SENSE : NO 

(v) FRAGMENT TYPE: internal 
f~7 (vi) ORIGINAL SOURCE: 

□ (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

f'l 

Tyr Ala Val Thr Gly Arg Gly Asp Ser Pro Ala Ser Ser Lys Pro lie 
15 10 15 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 



2 



n 

&i7 



Tyr Ala Glu Arg Asp Tyr Arg Leu Asp Tyr Pro lie 
15 10 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE : NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

Tyr Ala Val Arg Asp Tyr Arg Leu Asp Tyr Pro lie 
15 10 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Tyr Ala Val Arg Asp Tyr Arg Leu Asp Tyr Ala Ser Ser Lys Pro lie 
15 10 15 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
Tyr Ala Val Arg Asp Tyr Arg Leu Asp Tyr Lys Pro lie 
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15 10 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Tyr Ala Val Arg Asp Tyr Arg Ser Lys Pro lie 
15 10 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

Tyr Ala Val Thr Arg Asp Tyr Arg Leu Ser Ser Lys Pro lie 
15 10 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Tyr Ala Val Thr Glu Arg Asp Tyr Arg Leu Ser Ser Lys Pro lie 
15 10 15 
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(2) INFORMATION FOR SEQ ID NO: 11: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Tyr Ala Val Ala Val Val Ser Tyr Tyr Ala Met Asp Tyr Pro He 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 12 : 

Tyr Ala Val Thr Ala Val Val Ser Tyr Tyr Ala Ser Ser Lys Pro He 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 
CGGGATCCCA TATGCAGGTT TCTGATGTTC CGCGTGACCT GGAAGTTGTT GCTGCGACC 



(2) INFORMATION FOR SEQ ID NO : 14 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 : 
TAACTGCAGG AGCATCCCAG CTGATCAGCA GGCTAGTCGG GGTCGCAGCA ACAAC 55 
(2) INFORMATION FOR SEQ ID NO:15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
CTCCTGCAGT TACCGTGCGT TATTACCGTA TCACGTACGG TGAAACCGGT G 51 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
GTGAATTCCT GAACCGGGGA GTTACCACCG GTTTCACCG 3 9 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:17: 
AGGAATTCAC TGTACCTGGT TCCAAGTCTA CTGCTACCAT CAGCGG 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
GTATAGTCGA CACCCGGTTT CAGGCCGCTG ATGGTAGC 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CGGGTGTCGA CTATAC CATC ACTGTATACG CT 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 



(iv) ANT I SENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 
CGGGATCCGA GCTCGCTGGG CTGTCACCAC GGCCAGTAAC AGCGTATACA GTGAT 55 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
CAGCGAGCTC CAAGCCAATC TCGATTAACT ACCGT 3 5 

(2) INFORMATION FOR SEQ ID NO: 22; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

CGGGATCCTC GAGTTACTAG GTACGGTAGT TAATCGA 37 

(2) INFORMATION FOR SEQ ID NO:23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
CGGGATCCAC GCGTGCCACC GGTACGGTAG TTAATCGA 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
CGGGATCCAC GCGTCCATTC GTTTGTGAAT ATCAAGGCCA ATCG 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
CCGGAAGCTT TAAGACTCCT TATTACGCAG TATGTTAGC 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
CTGTTACTGG CCGTGAGATC TAACCAGCGA GCTCCA 



(2) INFORMATION FOR SEQ ID NO: 27: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 51 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 
GATCAGCTGG GATGCTCCTN NKNNKNNKNN KNNKTATTAC CGTATCACGT A 51 
(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

TGTATACGCT GTTACTGGCN NKNNKNNKNN KNNKNNKNNK TCCAAGCCAA TCTCGAT 57 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

CTGTATACGC TGTTACTGGC NNKNNKNNKN NKCCAGCGAG CTCCAAG 4 7 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 51 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISEMSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
CATCACTGTA TACGCTGTTA CTNNKNNKNN KNNKNNKTCC AAGC CAATCT C 
(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 1 : 

Cys Ala Arg Arg Ala 
1 5 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE : NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 

Arg Trp lie Pro Leu Ala Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
<iv) ANTISENSE : NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 

Cys Trp Arg Arg Ala 
1 5 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 

Arg Trp Val Gly Leu Ala Trp 
1 5 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35 

Cys Lys His Arg Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

Phe Ala Asp Leu Trp Trp Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 i ne ar 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

Cys Arg Arg Gly Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: 

Arg Gly Phe Met Trp Leu Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39 

Cys Asn Trp Arg Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40 

Arg Ala Tyr Arg Tyr Arg Trp 
1 5 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41 

Ser Arg Leu Arg Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
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(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42 

Pro Pro Trp Arg Val 
1 5 

(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43 

Ala Arg Trp Thr Leu 
1 5 

(2) INFORMATION FOR SEQ ID NO:44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44 

Arg Arg Trp Trp Trp 
1 5 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
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(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

Gly Gin Arg Thr Phe 
1 5 

(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

Arg Arg Trp Trp Ala 
1 5 

(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

Ala Val Thr Val Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 
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(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:48: 

Arg Gly Asp Ser Pro Ala Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

Cys Asn Trp Arg Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO:50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

Arg Ala Tyr Arg Tyr Arg Trp 
1 5 

(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51 

Cys Met Trp Arg Ala 
1 5 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52 

Arg Trp Gly Met Leu Arg Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO:53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53 

Ala Arg Met Arg Glu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54 



18 



Arg Trp Leu Arg Gly Arg Tyr 
1 5 



(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55 

Cys Ala Arg Arg Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56 

Arg Arg Ala Gly Trp Gly Trp 
1 5 

(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57 



Cys Asn Trp Arg Arg 



(2) INFORMATION FOR SEQ ID NO:58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58 

Arg Ala Tyr Arg Tyr Arg Trp 
1 5 

(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59 

Arg Trp Arg Glu Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60 

Arg His Pro Trp Thr Glu Arg 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

Cys Asn Trp Arg Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 62 : 

Arg Ala Tyr Arg Tyr Arg Trp 
1 5 

(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:63: 

Glu Arg Arg Val Pro 
1 5 

(2) INFORMATION FOR SEQ ID NO: 64: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 
<vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 64 

Arg Leu Leu Leu Trp Gin Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65 

Gly Arg Gly Ala Gly 
1 5 

(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66 

Phe Gly Ser Phe Glu Arg Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 67: 



(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67 

Cys Arg Trp Thr Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68 

Arg Arg Trp Phe Asp Gly Ala 
1 5 

(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69 

Cys Asn Trp Arg Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 7 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE : NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70 

Arg Ala Tyr Arg Tyr Arg Trp 
1 5 

(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71 

Ala Val Thr Val Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72 

Gly Arg Gly Asp Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO:73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE : NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73 

Gly Gin Arg Thr Phe 
1 5 

(2) INFORMATION FOR SEQ ID NO:74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74 

Arg Arg Trp Trp Ala 
1 5 

(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75 

Gly Gin Arg Thr Phe 
1 5 

(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE : 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76 

Arg Arg Trp Trp Ala 
1 5 

(2) INFORMATION FOR SEQ ID NO : 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77 

Gly Gin Arg Thr Phe 
1 5 

(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78 

Arg Arg Trp Trp Ala 
1 5 

(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE : NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79 

Leu Arg Tyr Arg Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80 

Gly Trp Arg Trp Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81 

Gly Gin Arg Thr Phe 
1 5 

(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 

Arg Arg Trp Trp Ala 
1 5 

(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : £ 

Gly Gin Arg Thr Phe 
1 5 

(2) INFORMATION FOR SEQ ID NO : 84 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 

Arg Arg Trp Trp Ala 
1 5 

(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
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(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 

Leu Arg Tyr Arg Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86 

Gly Trp Arg Trp Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87 

Leu Arg Tyr Arg Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
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(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88 

Gly Trp Arg Trp Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89 

Gly Gin Arg Thr Phe 
1 5 

(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90 

Arg Arg Trp Trp Ala 
1 5 

(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 
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(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 

Leu Arg Tyr Arg Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:92: 

Gly Trp Arg Trp Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:93: 
GCAGTTACCG TGCGT 

(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 
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(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 94: 

Ala Val Trp Val Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 
GGCCGTGGTG ACAGCCCAGC GAGC 24 
(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 

Gly Arg Gly Asp Ser Pro Ala Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:97: 
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TCGAGGTTGC GGCGG 



(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 

Ser Arg Leu Arg Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 
CCGCCGTGGA GGGTG 



(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100 



Pro Pro Trp Arg Val 
1 5 



(2) INFORMATION FOR SEQ ID NO; 101: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE : NO 

(v) FRAGMENT TYPE : 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101 
GGTCAGCGAA CTTTT 

(2) INFORMATION FOR SEQ ID NO: 102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102 

Gly Gin Arg Trp Phe 
1 5 

(2) INFORMATION FOR SEQ ID NO:103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103 
AGGCGGTGGT GGGCT 

(2) INFORMATION FOR SEQ ID NO: 104: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID 



Arg Arg Trp Trp Ala 
1 5 

(2) INFORMATION FOR SEQ ID NO:105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 

15 

GCGAGGTGGA CGCTT 

(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:106: 



Ala Arg Trp Thr Leu 
1 5 

(2) INFORMATION FOR SEQ ID NO : 1 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 
AGGCGGTGGT GGTGG 15 
(2) INFORMATION FOR SEQ ID NO: 108: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE : NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 

Arg Arg Trp Trp Trp 
1 5 

(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 

Gly Lys Lys Gly Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE : NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 



Val Ser Asp Val Pro Arg Asp Leu Glu Val Val Ala Ala Thr Pro Thr Ser Leu Leu 

15 10 15 

lie Ser Trp Asp Ala Pro Ala Val Thr Val Arg Tyr Tyr Arg lie Thr Tyr Gly Glu 
20 25 30 35 

Thr Gly Gly Asn Ser Pro Val Gin Glu Phe Thr Val Pro Gly Ser Lys Ser Thr Ala 

40 45 50 55 

Thr lie Ser Gly Leu Lys Pro Gly Val Asp Tyr Thr lie Thr Val Tyr Ala Val Thr 

60 65 70 75 

Gly Arg Gly Asp Ser Pro Ala Ser Ser Lys Pro lie Ser lie Asn Tyr Arg Thr 
80 85 90 

(2) INFORMATION FOR SEQ ID NO: ill: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 308 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 

CATATGCAGG TTTCTGATGT TCCGCGTGAC CTGGAAGTTG TTGCTGCGAC CCCGACTAGC 60 

CTGCTGATCA GCTGGGATGC TCCTGCAGTT ACCGTGCGTT ATTACCGTAT CACGTACGGT 12 0 

GAAACCGGTG GTAACTCCCC GGTTCAGGAA TTCACTGTAC CTGGTTCCAA GTCTACTGCT 180 

ACCATCAGCG GCCTGAAACC GGGTGTCGAC TAT AC CATC A CTGTATACGC TGTTACTGGC 240 

CGTGGTGACA GCCCAGCGAG CTCCAAGCCA ATCTCGATTA ACTACCGTAC CTAGTAACTC 3 00 

GAGGATCC 3 08 
(2) INFORMATION FOR SEQ ID NO: 112: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
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(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 



Met Gin Val Ser Asp Val Pro 

1 5 
Leu Leu lie Ser Trp Asp Ala 
20 25 
Gly Glu Thr Gly Gly Asn Ser 

40 45 
Thr Ala Thr He Ser Gly Leu 
60 

Val Thr Gly Arg Gly Asp Ser 
80 

Thr 



Arg Asp Leu Glu Val Val 
10 

Pro Ala Val Thr Val Arg 
30 

Pro Val Gin Glu Phe Thr 
50 

Lys Pro Gly Val Asp Tyr 
65 70 
Pro Ala Ser Ser Lys Pro 
85 



Ala Ala Thr Pro Thr Ser 
15 

Tyr Tyr Arg He Thr Tyr 
35 

Val Pro Gly Ser Lys Ser 
55 

Thr He Thr Val Tyr Ala 
75 

He Ser He Asn Tyr Arg 
90 95 



(2) INFORMATION FOR SEQ ID NO: 113: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 



Arg Gly Asp 
1 



(2) INFORMATION FOR SEQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 



Met Gly Ser Ser His His His His His His Ser Ser Gly Leu Val Pro 

15 10 15 

Arg Gly Ser His 
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(2) INFORMATION FOR SEQ ID NO: 115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE : NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 

Gly Gin Arg Thr Phe Arg Arg Trp Trp Ala 
15 10 

(2) INFORMATION FOR SEQ ID NO: 116: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 

Leu Arg Tyr Arg Ser Gly Trp Arg Trp Arg 
15 10 

(2) INFORMATION FOR SEQ ID NO: 117: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 

Cys Asn Trp Arg Arg Arg Ala Tyr Arg Tyr Trp Arg 
15 10 

(2) INFORMATION FOR SEQ ID NO: 118: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 

Arg Met Arg Glu Arg Trp Leu Arg Gly Arg Tyr 
5 10 
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